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Preface 


Since the seminal works of Black-Scholes-Merton in 1973, the world of finance has been 
revolutionized by the emergence of a new field known as financial engineering. On the one 
hand, markets (foreign exchange, interest rate, commodities, etc.) have become more volatile, 
which creates an increase in the demand for derivatives products (options, forwards, futures, 
swaps, hybrids and exotics, and credit derivatives to name a few) used to measure, control, 
and manage risks, as well as to speculate and take advantage of arbitrage opportunities. 

On the other hand, technological advances have enabled financial institutions and other mar- 
kets players to create, price and launch new products and services to not only hedge against 
risks, but also to generate revenues from these risks. In addition to a deep grasp of advanced 
financial theories, the design, analysis and development of these complex products and finan- 
cial services, or financial engineering, necessitate a mastering of sophisticated mathematics, 
statistics and numerical computations. 

By way of an integrated approach, the object of this book is to teach the reader: 


© to apply stochastic calculus and simulation techniques to solve financial problems; 
è to develop and/or adapt the existing contingent claims models to support financial engineer- 
ing platforms and applications. 


There are several books in the market covering stochastic calculus and Monte Carlo simulations 
in finance. These books can be roughly grouped into two categories: introductory or advanced. 
Unfortunately, the books at the introductory level do not answer the needs of upper-level 
undergraduate and graduate students and finance professionals and practitioners. Advanced 
books, being very sophisticated and specialized, are tailored for researchers and users with 
solid and esoteric scientific backgrounds in mathematics and statistics. Furthermore, these 
books are often biased towards the research interests of the authors, hence their scope is 
narrowed and their applications in finance limited. By and large, the existing books are less 
suitable for day-to-day use which is why there is a need for a book that can be used equally by 
beginners and established researchers wishing to acquire an adequate knowledge of stochastic 
processes and simulation techniques and to learn how to formulate and solve problems in 
finance. 

This book, which has developed from the master programme in financial engineering at 
Laval University in Canada first offered in 1999, aims to reinforce several aspects of simulation 
techniques and their applications in finance. Building on an integrated approach, the book 


xiv Preface 


provides a pedagogical treatment of the material for senior undergrad and graduate students as 
well as professionals working in risk management and financial engineering. While initiating 
students into basic concepts, it covers current up-to-date problems. It is written in a clear, 
concise and rigorous pedagogical language, which widens accessibility to a larger audience 
without sacrificing mathematical rigor. By way of a gradual learning of existing theories and 
new developments, our goal is also to provide an approach to help the reader follow the relevant 
literature which continually expands at a rapid pace. 

This book is intended for students in business, economics, actuarial sciences, computer 
sciences, general sciences, and engineering, programmers and practitioners in financial, in- 
vestment/asset and risk management industries. The prerequisites for the book are some 
familiarity in linear algebra, differential calculus and programming. 

The book introduces and trains users in the formulation and resolution of financial problems. 
As exercises, it provides computer programs for use with the practical examples, exercises and 
case studies, which give the reader specific recipes for solving problems involving stochastic 
processes in finance. The programming language is the MATLAB®! software which is easy 
to learn and popular among professionals and practitioners. Moreover, the programs could be 
readily converted for use with the platform C++. Note that, unlike the MATLAB financial 
toolboxes which are still limited in scope, our proposed exercises and case studies tackle the 
complex problems encountered routinely in finance. 

Overall, the general philosophy of the book can be summarized as follows: 


¢ keep mathematical rigor by minimizing abstracts and unnecessary jargon; 
e each concept, either in finance or in computation, leads to algorithms and is illustrated by 
concrete examples in finance. 


Therefore, after they are discussed, the topics are presented in algorithmic forms. Furthermore, 
some of the examples which treat current financial problems are expounded in case studies, 
enabling students to better comprehend the underlying financial theory and related quantitative 
methods. 

Every effort has been made to structure the chapters in a logical and coherent manner, with 
a clear thread and linkage between the chapters which is not apparent in most existing books. 
Each chapter has been written with regard to the following four principles: pedagogy, rigor, 
relevance and application. Advanced readers can skip the chapters they are familiar with and 
go straight to those of interest. 

The book starts with a refresher of basic probability and statistics which underpin random 
processes and computer simulation techniques introduced later. Most of the developed tools 
are used later to study computational problems of derivative products and risk management. 
The text is divided into the following four major parts. The first part (Chapters 1 to 3) reviews 
basic probability and statistics principles. The second part (Chapters 4 to 6) introduces the 
Monte Carlo and Quasi Monte Carlo simulations topics and techniques. In addition to the 
other commonly used variance reduction techniques, we introduce the quadratic resampling 
technique of Barraquand (1995) to obtain the prescribed distribution characteristics of the 
simulated samples, which is important to improve the quality of the simulations. We also 
present the Markov Chain Monte Carlo (MCMC) and important sampling methods. The third 
part (Chapters 7 and 8) treats random processes, stochastic calculus, Brownian bridges, jump 


1 MATLAB is a registered trademark of The MathWorks, Inc. For more information, see htt://www.mathworks.com. 


Preface XV 


processes and stochastic differential equations. Finally, the fourth part (Chapters 9 to 15) 
develops the applications in finance. 

To price contingent claims, two equivalent approaches are used in finance: the state variables 
approach consisting of solving partial differential equations and the probabilistic or equiv- 
alent martingale approach. The equivalence between the two approaches is established via the 
Feynman-Kac theorem. Our purpose is to teach how to solve numerically stochastic differen- 
tial equations using Monte Carlo simulations, which essentially constitutes the pedagogical 
contribution of our book. 

The fourth part of the book presents different applications of stochastic processes and simu- 
lation techniques to solve problems frequently encountered in finance. This part is structured 
as follows. Chapter 9 lays the foundation to price and replicate contingent claims. Chapter 
10 prices European, American and other complex and exotic options using Monte Carlo 
simulations. Chapter 11 presents modern continuous-time models of the term structure of 
interest rates and the pricing of interest rate derivatives. Chapters 12 and 13 develop valuation 
models of corporate securities and credit risk. Chapters 14 and 15 overview risk management 
and develop estimations of Value at Risk (VaR) by combining Monte Carlo and Quasi Monte 
Carlo simulations with Principal Components Analysis. 

Although this is an introductory and pedagogical book, nonetheless, in Chapter 10 we 
explain many useful and modern simulation techniques such as the Least-Squares Method 
(LSM) of Longstaff and Schwartz (2001) and the dynamic programming with Stratified State 
Aggregation of Barraquand and Martineau (1995) to price American options, the extreme 
value simulation technique proposed by El Babsiri and Noel (1998) to price exotic options 
and the Retrieval of Volatility Method proposed by Cvitanic, Goukassian and Zapatero (2002) 
to estimate the option sensitivity coefficients or hedge ratios (the Greeks). Note that, to our 
knowledge, with the exception of LSM, this is the first book to bring to the fore these important 
techniques. In Chapter 11 on term structure of interest rates modeling and pricing of interest 
rate derivatives, we present the interest rate model of Heath, Jarrow and Morton (1992) and the 
industry-standard Market Model of Brace, Gatarek and Musiela (2001). An extensive treatment 
of corporate securities valuation and credit risk based on the structural approach of Merton 
(1974) is presented in chapter 12. Chapter 13 gives case studies on financial guarantees to 
show how the simulations techniques can be implemented, and this chapter is inspired from the 
research publications of the authors. As such, Chapters 12 and 13 provide indispensable funda- 
mentals for a reader to embark on the study of structured products design and credit derivatives. 

To perform a sound simulation experiment, one has to undertake roughly the following 
three steps: (1) modeling of the problem to be studied, (2) calibration/estimation of the model 
parameters, and (3) backtesting using real data and recalibration. This book focuses on the use 
of Monte Carlo and Quasi Monte Carlo simulations in finance for the sake of pricing and risk 
management assuming the dynamics of the underlying variables are known. 

We do not pretend that the book provides complete coverage of all topics and issues; future 
editions would include application examples of the Markov Chain Monte Carlo (MCMC) 
simulation technique, estimation techniques of the parameters of the diffusion processes and 
the determination of the assets variance-covariance matrix, the spectral analysis, real options, 
volatility derivatives, etc. 

We would like to thank Pete Baker from Wiley, our copy-editor Rachael Wilkie and anony- 
mous reviewers for their very constructive comments which help improve this book. We also 
thank Noél Amenc, Didier Cossin, Rose-Anne Dana, Catherine Hersent, Lionel Martellini, 
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Thierry Roncalli and Yves Simon for their various contributions and comments during the 
gestation of this book. We reserve a special thank-you to Yves Langlois and Jean-François 
Carter for their valuable assistance. We extend our gratitude to André Gascon (Associate Dean 
Academics), Michel Gendron (Head of the Department of Finance and Insurance), Maurice 
Gosselin (former Associate Dean Research), Fayez Boctor (current Associate Dean Research) 
and Robert Mantha (Dean), all from the Faculty of Business Administration of Laval Univer- 
sity, for their continual support and financial contribution to the production of the book. 
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l 
Introduction to Probability 


Since financial markets are very volatile, in order to model financial variables we need to char- 
acterize randomness. Therefore, to study financial phenomena, we have to use probabilities. 

Once defined, we will see how to use probabilities to describe the evolution of random 
parameters that we later call random processes. The key step here is the quantitative construc- 
tion of the events’ probabilities. First, one must define the events and then the probabilities 
associated to these events. This is the objective of this first chapter. 


1.1 INTUITIVE EXPLANATION 
1.1.1 Frequencies 


Here is an example to illustrate the notion of relative frequency. We toss a dice N times and 
observe the outcomes. We suppose that the 6 faces are identified by letters A, B, C, D, E and F. 
We are interested in the probability of obtaining face A. For that purpose, we count the number 
of times that face A appears and denote it by n(A). This number represents the frequency of 
appearance of face A. 

Intuitively, we see that the division of the number of times that face A appears, n(A), by the 
total number N of throws, D, is a fraction that represents the probability of obtaining face 
A each time that we toss the dice. In the first series of experiments when we toss the dice N 
times we get nı (A) and if we repeat this series of experiments another time by tossing it again 
N times, we obtain n(A) of outcomes A. 

It is likely that nı (A) and n2(A) are different. The fractions ms) and are then different. 
Therefore, how can we say that this fraction quantifies the probability of obtaining face A? To 
find an answer, we need to continue the experiment. Even if the fractions are different, when 
the number N of throws becomes very large, we observe that these two fractions converge to 
the same value of L. 

Intuitively, this fraction measures the probability of obtaining face A, and when N is large, 
this fraction goes to i. Thus, each time we toss the dice, it is natural to take z as the probability 
of obtaining face A. 

Later, we will see that from the law of large numbers these fractions converge to this limit. 
This limit, E, corresponds to the concept of the ratio of the number of favorable cases over the 
total number of cases. 


n(A) 
N 


1.1.2 Number of Favorable Cases Over The Total Number of Cases 


When we toss a dice, there is a total of 6 possible outcomes, {1, 2, 3, 4, 5, 6}, corresponding 
to the letters on faces {A, B, C, D, E, F }. If we wish to obtain face A and we have only one 
such case, then the probability of getting face A is quantified by the fraction H. However, we 
may be interested in the event {“the observed face is even” }. What does this mean? The even 
face can be 2, 4 or 6. Each time that one of these three faces appears, we have a realization of 
the event {“the observed face is even”}. This means that when we toss a dice, the total number 
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of possible cases is always 6 and the number of favorable cases associated to even events is 3. 
Therefore, the probability of obtaining an even face is simply 2 and intuitively this appears to 
be correct. 

From this consideration, in the following section we construct in an axiomatic way the 
mechanics of what is happening. However, we must first establish what is an event, and then 
we must define the probabilities associated with an event. 


1.22 AXIOMATIC DEFINITION 


Let’s define an universe in which we can embed all these intuitive considerations in an 
axiomatic way. 


1.2.1 Random Experiment 


A random experiment is an experiment in which we cannot precisely predict the outcome. 
Each result obtained from this experiment is random a priori (before the realization of the 
experiment). Each of these results is called a simple event. This means that each time that we 
realize this experiment we can obtain only one simple event. Further we say that all simple 
events are exclusive. 


Example 2.1 Tossing a dice is a random experiment because before the toss, we cannot 
exactly predict the future result. The face that is shown can be 1, 2, 3, 4, 5 or 6. Each of 
these results is thus a simple event. All these 6 simple events are mutually exclusive. 


We denote by Q the set of all simple events. The number of elements in Q can be finite, 
countably infinite, uncountably infinite, etc. The example with the dice corresponds to the first 
case (the case of a “finite number of results”, Q = {1, 2, 3, 4, 5, 6}). 


Example 2.2 We count the number of phone calls to one center during one hour. The 
number of calls can be 0, 1, 2, 3, etc. up to infinity. An infinite number of calls is evidently 
an event that will never occur. However, to consider it in the theoretical development 
allows us to build useful models in a relatively simple fashion. This phone calls example 
corresponds to the countably infinite case (Q = { 0, 1, 2, 3,..., co}). 


Example 2.3 When we throw a marble on the floor of a room, the position on which the 
marble will stop is a simple event of the experiment. However, the number of simple events 
is infinite and uncountable. It corresponds to the set of all points on the floor. 


Building a probability theory for the case of finite experiments is relatively easy, the gener- 
alization to the countably infinite case is straightforward. However, the uncountably infinite 
case is different. We will point out these differences and technicalities but we will not dwell 
on the complex mathematical aspects. 


1.2.2 Event 


We consider the experiment of a dice toss. We want to study the “even face” event. This event 
happens when the face shown is even, that is, one of 2, 4, or 6. 
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Thus, we can say that this event “even face” contains three simple events {2, 4, 6}. This 
brings us to the definition: 


Definition 2.4 Let Q be the set of simple events of a given random experiment. Q is called 
the sample space or the universe. An event is simply a sub-set of Q. 


Is any subset of Q an event? This question will be answered below. We must not forget that 
an event occurs if the realized simple event belongs to this event. 


1.2.3 Algebra of Events 


We saw that an event is a subset of Q. We would like to construct events from Q. Let Q be the 
universe and let £ be the set of events we are interested in. We consider the set of all events. € 
is an algebra of events if the following axioms are satisfied: 


Al: Qeé, 

A2: VA € &, AS = Q\A e€ é (where Q\A, called the complementary of A, is the set of all 
elements of Q which do not belong to A), 

A3: WA, A2,...,An EE, A UA2U...U A, € &. 


Axiom A1 says that the universe is an event. This event is certain since it happens each time 
that we undertake the experiment. Axiom A1 and axiom A2 imply that the empty set, denoted 
by Ø, is also an event but it is impossible since it never happens. Axiom A3 says that the union 
of a finite number of events is also an event. To be able to build an algebra of events associated 
with a random experiment encompassing a countable infinity of simple events, axiom A3 will 
be replaced by: 

A3’: UPS, A, =A, UAU... UA U... EE. 
This algebra of events plays a very important role in the construction of the probability of 
events. The probabilities that we derive should follow the intuition developed previously. 


1.2.4 Probability Axioms 


Let Q be the universe associated with a given random experiment on which we build the 
algebra of events £. We associate to each event A € & a probability noted Prob(A), representing 
the probability of event A occurring when we realize the experiment. From our intuitive setup, 
this probability must satisfy the following axioms: 


P1: Prob(Q) = 1, 

P2: VAeé,0< Prob(A) < 1, 

P3: if A,,A2,...,An,... is a series of mutually exclusive events, that is: Vi Æ j, AINA; = Ø, 
then 


n=1 


Prob(U%.,A,) = ) Prob(A,). (1.1) 


n=1 
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Axiom P3 is called o —additivity of probabilities. This axiom allows us to consider random 


experiments with an infinity of possible outcomes. From these axioms, we can see that 


Prob(@) =0 and Prob(A‘) = 1 — Prob(A) (1.2) 
which are intuitively true. 
A very important property easy to derive is presented below. 
Property 2.5 Consider two events A and B, then 
Prob(A U B) = Prob(A) + Prob(B) — Prob(A N B). (1.3) 
The mathematical proof is immediate. 
Proof: Let A \ C be the event built from elements of A that do not belong to C. 
A=(A\C)UC where C=AN B. (1.4) 
Since A \ C and C are disjoint, from axiom P3, 
Prob(A) = Prob(A \ C) + Prob(C). (1.5) 
Similarly 
Prob(B) = Prob(B \ C) + Prob(C). (1.6) 
Adding these two equations yields: 
Prob(A \ C) + Prob(B \ C) + Prob(C) = Prob(A) + Prob(B) — Prob(C). (1.7) 
Moreover, 
AUB=(A\C)U(B\C)UC, (1.8) 
and since A \ C, B \ C and C are disjoint, we have 
Prob(A U B) = Prob(A \ C) + Prob(B \ C) + Prob(C), (1.9) 
thus, 
Prob(A U B) = Prob(A) + Prob(B) — Prob(C). (1.10) 


Example 2.6 Let’s go back to the dice toss experiment with 


Q = {1, 2,3,4, 5, 6} 


and consider the events: 


(a) A = {“face smaller than 5”} = {1, 2, 3, 4}. 
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Since events {1}, {2}, {3}, and {4} are mutually exclusive, we know from axiom 
P3 that: 
4 
Prob(A) = Prob({1}) + Prob({2}) + Prob({3}) + Prob({4}) = z 
(b) B = {“even faces” } = {2, 4, 6}. 


Thus, A U B = {1, 2, 3, 4, 6} and A N B = {2, 4}. We also have, Prob(A) = z, 
Prob(B) = 2, Prob(A N B) = Prob({2, 4}) = 2, which implies 


4 3 2 5 
Prob(A U B) = Prob(A) + Prob(B) — Prob(A N B) = 6 + e 6e 


Next, we discuss events that may be considered as independent. To present this, we must 
first discuss the concept of conditional probability, i.e., the probability of an event occurring 
given that another event already happened. 


1.2.5 Conditional Probabilities 


Let A and B be any two events belonging to the same algebra of events. We suppose that B has 
occurred. We are interested in the probability of getting event A. To define it, we must look 
back to the construction of the algebras of events. 

Within the universe Q in which A and B are two well-defined events, if B has already 
happened, the elementary event associated with the result of this random experiment must be 
an element belonging to event B. This means that given B has already happened, the result of 
the experiment is an element of event B. 

Intuitively, the probability of A occurring is simply the probability that this result is also an 
event of B. If B has already happened, the probability of getting A knowing B is the probability 
of A N B divided by the probability of B. Therefore, we obtain 

Prob(A A B) 
Prob(A|B) = ———_—_.. (1.11) 
Prob(B) 
This definition of the conditional probability is called Bayes’ rule. 

This probability satisfies the set of axioms for probabilities introduced at the beginning of 

the section: 


Prob(Q2|B) = 1, (1.12) 
0 < Prob(A|B) < 1, (1.13) 
Prob(A‘|B) = 1 — Prob(A|B), (1.14) 
and 
CO 
Prob(U%., A,|B) = X Prob(A,,|B), ViAj, A;NA; =. (1.15) 
n=1 


This definition is illustrated next by way of examples. 
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Example 2.7 Consider the dice toss experiment with event 


A = {“face smaller than 5"} = {1, 2, 3, 4} 


and event 

B = {“even face"} = {2, 4, 6}. 
We know that 

Prob(B) = Prob({2, 4, 6}) = : 
and 


4 
Prob(A) = Prob({1, 2, 3, 4}) = z 
However, we want to know what is the probability of obtaining an even face knowing that 
the face is smaller than 5 (in other words, A has already happened). From Bayes’ rule: 
Prob(A N B) 

Prob(A) 

___ Prob({2, 4}) 
~~ Prob({1, 2, 3, 4}) 
_ 2/6 
~ 4/6 


Prob(B|A) = 


Example 2.8 From a population of N persons, we observe ns smokers and n people 
with cancer. From these ns smokers we observe ns < individuals suffering from cancer. For 
this population, we can say that the probability that a person is a smoker is 5+ and the 
probability that a person has cancer is 4. The probability that a person has cancer given 
that he is already a smoker is: 


Prob(smoker and cancer) Asc 
Prob(cancer|smoker) = = 


Prob(smoker) ony! 


From this experiment, we note that the conditional probability can be smaller or greater than 
the probability considered a priori. Following this definition of the conditional probability, we 
examine next the independence of two events. 


1.2.6 Independent Events 


Two events are said to be statistically independent when the occurrence of one of them doesn’t 
affect the probability of getting the other. A and B are said to be statistically independent if 


Prob(A|B) = Prob(A). (1.16) 
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From Bayes’ rule, if A and B are two independent events then 


Prob(A N B) = Prob(A)Prob(B). (1.17) 


Example 2.9 Consider the experiment of tossing a dice twice. Intuitively, we hope that 
the result of the first toss would be independent of the second one. From our preceding 
exposition, we can establish this independence as follows. Indeed, the universe of this 
experiment contains 36 simple events denoted by (R1, R2) where R1 and R2 are respectively 
the results of the first and second tosses, with (R1, R2) taking values (n, m) in 


Q = {1, 2,3, 4,5,6} x {1, 2, 3, 4, 5, 6}. 
The probability the first element R1 equals n is 


1 
Prob(R1 = n) = g Vn € {1, 2, 3,4, 5, 6} 
and the probability the second element R2 equals m is 


1 
Prob(R2 = m) = 6’ Vm € {1, 2,3, 4, 5, 6}. 


Since Prob(R1 = n, R2 =m) = % then the conditional probability 
Prob(R1l = n, R2 = m) 
Prob(R2 = m|R1 = n) = 
Prob(R1 = n) 
l 
BE EN 
Sees 
4 6 


which gives us Prob(R2 = m|R1 = n) = Prob(R2 = m) = i. Hence, we conclude that R2 
and R1 are independent. 


Notes and Complementary Readings 


The concepts presented in this chapter are fundamentals of the theory of probabilities. The 
reader could refer to the books written by Ross (2002 a and b) for example. 
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by Huu Tue Huynh, Van Son Lai and Issouf Soumaré 
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Introduction to Random Variables 


In the previous chapter, we introduced some concepts of events and defined probabilities on 
sets of events. In this chapter, we will focus on the representation of realized events on the 
real axis and probability space in order to provide a quantification to be used in financial 
problems. 

A random variable is a function mapping the sample space Q to the real axis. Afterwards, a 
complete characterization of such random variables will be given by introducing the probability 
density function, the cumulative distribution function and the characteristic function. We 
will show examples of the most frequently-encountered random variables in finance. The 
characteristic function will be presented in order to give the reader a better understanding of 
random variables. We will not use it extensively later in the book, but it is useful to be familiar 
with it to enable us to follow some proofs. 

We will also introduce the concept of transformation of random variables. This concept 
is the basis of random variables simulation under known distributions and will be used in 
subsequent chapters. 


2.1 RANDOM VARIABLES 


We have defined random events and the probabilities associated with these events. In finance, as 
in the sciences, random events are always associated with quantities such as indices, costs and 
interest rates which vary in a random way. This means that we could link these experiments’ 
random effects to the real axis. In other words, we associate a real number with the result 
given by the experiment. 

Before realizing the experiment, this number is not known — it behaves as a random result 
from a random experiment. This approach means that we are looking to create a random 
experiment on the real axis the results of which are what we will call a random variable. 

Mathematically, the random experiment on the real axis is created by using a function 
(denoted by X) from the universe of events Q on the real axis. The random results observed on 
the axis under this function are used as the basis to define the random events on the real axis. 
This representation on the real axis obeys the same rules or is subject to the same constraints 
as the original events. 

This function, or transformation X, must satisfy the following condition: let x be any real 
value, the set of all elementary events w such that {X(w) < x} is an event associated with the 
original random experiment 


A = {æ such that X (w) < x}. (2.1) 


In mathematical terms, this function is said to be measurable. 

Now, using this random variable, we only need to look at the universe of events as the real 
axis and the events as a subset of the real line. The most simple events are open or closed 
intervals and open or closed half axes. The constructed algebra of events based on these 
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natural events is known as the Borel Algebra of the real axis. One simple way to describe 
the Borel Algebra of the real axis is to construct events by combining the simple open and 
closed intervals and open and closed half axis of the real axis. Since Borel Algebra is not 
really necessary to follow the text, we will not dwell on it further. 


2.1.1 Cumulative Distribution Function 


Let X be a real-valued random variable, by definition its cumulative distribution function, 
noted F(x), is: 


Fy(x) = Prob(A) where A = {wsuch that X(w) < x}. (2.2) 
Based on the previous definition of probability, we deduce the following properties: 


(a) 0< Fxy@) <1, 
(b) Fx(x) is monotone, non decreasing, i.e., if x; < x2 then we have Fy(x1) < Fx(x2), 
(c) Fy(—oo) = 0 and Fy(+00) = 1. 


Properties (a), (b) and (c) follow from the probability axioms. When Fx(x) is continuous, X is 
said to be a continuous random variable. When it is the case, it can take any value on the real 
axis as the result of the experiment. However, when F(x) is a step function, X is said to be a 
discrete random variable. X may be a combination of continuous and discrete segments. 


2.1.2 Probability Density Function 


To keep it simple, consider X to be a continuous random variable. By analogy with the physical 
world, we can define its probability density function such that the integral of such a function 
on the event defined on the real axis gives the probability of this event. 

This density function can be obtained from the cumulative distribution function when 
looking at an infinitely small event. To see that, consider the event 


A={x < X <x+Ax}. (2.3) 
On the one hand, the cumulative function gives 
Prob(A) = Fx(x + Ax) — Fx(x), (2.4) 


and on the other hand we have 


x+Ax 
Prob(A) = i fx(a)da, (2.5) 


where fy is the probability density function of the random variable X. 
On the infinitesimal interval [x, x + Ax], since fx(.) is continuous, it remains constant so 
that we have: 


fx(x)Ax © Prob(A) = Fy(x + Ax) — Fy(x), (2.6) 
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where 
Fy(x + Ax) — Fy(x) 
Ax i 


When Ax becomes infinitesimal, we see that the probability density function is the derivative 
of the cumulative distribution function: 


fx) © (2.7) 


dFx(x) 


=y (2.8) 


Property 1.1 Function fx(x) is non negative. Since integrating the density function on an event 
gives us the probability of the event, if it were negative on a particular interval, integrating on 
this interval would give us a negative probability. This would violate our probability axioms. 
This property can be proved easily since the probability density function is the derivative of the 
cumulative distribution function. This cumulative function being a non decreasing function, 
its derivative can never be negative. 


Property 1.2 


Fy(x) = / i fx(a@)da <1, (2.9) 
Fy(—o0) =0 and Fy(+o0)=1, (2.10) 
which leads to 
fx) T 0 (2.11) 
and 
+00 
fx(a)da = 1. (2.12) 


For a discrete random variable X, since X takes values in a finite (or countably-infinite) set, 
we prefer to use the term probability mass function. The probability mass is the probability 
that X takes a precise value in this finite or countably-infinite set: 


Prob(X = k). (2.13) 


We present below examples of widely used random variables. 


Example 1.3 We toss a dice and the random variable is defined by the face which is 
shown on the dice. We define the random variable X taking the values 1, 2, 3, 4, 5, and 6. 
We have 


1 
Prob(X =k) =z, Vk € {1,2,3, 4,5,6). (2.14) 
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Example 1.4 We discussed above the number of calls received at a telephone exchange. 
Let X be this random variable. Then X can be 0, 1, 2, ..., 00. This phenomenon follows a 
distribution known as the Poisson distribution and its probability density function is defined 
by 

k 


À 
Prob(X = k) = Er (2.15) 
with k = 0, 1, 2,..., oo, where A is a positive constant depicting the average number of 


calls observed. 

This distribution is often used in finance in credit risk modeling, especially to describe 
credit default. In that case X can be the number of defaults in a given period and å is the 
average number of defaults. 


Example 1.5 The most common probability density functions are 


(i) The Gaussian normal distribution having the probability density function 


1 ai) 
fx@) = ie exp ( 72 ) , (2.16) 


where u and o are constants, o being positive. We show in the next section that u 
and ø are respectively the mean and standard deviation of the random variable X. 
This distribution is often used in finance to represent asset returns. It is also a key 
distribution in statistical inference. Figure 2.1 plots the probability density function 
for a variable following a normal distribution. 


0.4 
0.35 + 
0.3 + 
0.25; 


-4 2 0 2 4 
Figure 2.1 Gaussian density function 
(ii) The exponential density: 
fxQx)=ae, Yx € [0, +00], (2.17) 


where a is a positive constant. Figure 2.2 plots the probability density function for a 
variable following an exponential distribution. 
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0 1 2 3 4 5 


Figure 2.2 Exponential density function 


(iii) The uniform density: 


1 
fora<x<b 


fx@)=45-a l (2.18) 
(0) elsewhere 


where a and b are constants with b > a. Figure 2.3 plots the probability density 
function for a variable following a uniform distribution. 


1.5 


0.5 


0 0.2 0.4 0.6 0.8 1 


Figure 2.3 Uniform density function 


Remark 1.6 Using the Dirac function ô defined by 


d(x) =0 x £0; 
+00 
i d(x)dx = 1 otherwise, Pe) 
—0o 


we can introduce the density function. For the purpose of this book, we prefer to straightfor- 
wardly introduce the probability masses fdx denoted by dF. 
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2.1.3 Mean, Variance and Higher Moments of a Random Variable 


Physically, when we want to determine the average value of a random experiment, we repeat 
the experiment N times and we note the results x;,i € {1, 2, ..., N}. 

Suppose that among the N experiments, x; appears n; times. The average value of the 
results is: 


Z=): (=) xi. (2.20) 


i=1 


ni 


Intuitively, the quantity 4 measures the probability of obtaining the result x;. In this case, the 
experimental or empirical average is 


N 
X= X Prob (X = xi). (2.21) 
i=1 


nj 


When N grows indefinitely, if X is a continuous random variable, the ratio 7 


infinitesimal quantity represented by fy(x;)dx at point x;. 
This quantity with its physical interpretation is called statistical average and will be denoted 


goes to an 


+00 


my = E [X] = / xfx(x)dx. (2.22) 


(oe) 


The notation E[.] stands for the average value operator, commonly called mathematical 
expectation. This operator associated to an integral is linear, i.e., 


E [aX +bY]=aE [X]+bE[Y], (2.23) 


where a and b are two deterministic constants and X and Y are any two random variables. 

If we now look at another random variable Y linked to the random variable X by the 
transformation y = g(x), the average value of this new random variable Y is computed in 
making a large number N of experiments and denoting the results by x;. Hence, the average 
value for Y is 


PE, N Nn; N Nn; 
Y=) y = Y g(a). (2.24) 


The relative frequency X, when N goes to infinity, corresponds to the probability of getting 
the result x;. Thus we have: 


+00 
my = E [Y] =] B(x) fxQr)dx, (2.25) 
in particular when Y = X°, 
+00 
E [Y] =E[|x?°]= l x? fxœ)dx (2.26) 


is by definition the second order moment of the random variable X. 
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Figure 2.4 Three curves for the normal distribution with mean y = 0 and different ø 


We can center the variable X around its average value by: 
Y = X — E[X], (2.27) 
in this case E[Y] = 0 and 
E [Y?] = E [X - E [X}?] = E [X?] - E [XF . (2.28) 
The second order moment of the centered variable Y is by definition the variance of the 


random variable X, frequently referred to as o2. Meanwhile, og = Ja is called the standard 
deviation of X. 

Physically, ox measures the dispersion of the experiment’s results around its average value. 
Figure 2.4 plots the probability density function of the Gaussian distribution for three different 
values of o y. When oy is small, the graph of the probability density function of X is a curve 
concentrated around its mean. When ox is large, this curve flattens and gets wider. 

By the same means, we can introduce the moments of any order n: 


+00 
x= E[|x"]= f x" fx(x)dx (2.29) 
—oo 
and the order n centered moments 
+00 
Cy = E[(X —mx)"] = | (x — mx)" fx(x)dx. (2.30) 
=00 


Example 1.7 Let X be a Gaussian random variable with probability density function 


1 Q 
x)= e x2 , with o>Q0. 2.31 
fx) Was (2.31) 


The mean value for X is: 


+00 1 +00 ü- 
my = E [X] = f xfx(x)dx = / xe 2? dx. (2.32) 
4 -00 fx NV 210 —0o 
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To compute this value, we change our variable such that: u = x — u, which implies 


1 TeS u2 
E[X]= —— | (u + u)e 27 du 

2m0 J- 
1 S 2 1 ee 2 

= — ue %27 du + f ne 27? du. (2.33) 

J 2x0 J. V270 Joo 
To evaluate these integrals, we define the Gamma function: 
+00 
rœ) = f z™le™dz, Wx >0. (2.34) 
0 


We can easily show that 
r(x +1)=xr(x), 
r(k+1)=k!, 


and 


The Gamma function is widely used in statistics. With the properties of the Gamma function, 
the previous integrals can be computed by letting: ae = v, udu = o7 dv, du= zi, This 
way, the first integral is zero and the second is equal to: 


2u Foo u? 
Tna Í e 7 du =u. (2.35) 


It follows my = E[X] = u. 
Now we want to evaluate 


? = E[(X - mx] / = wee a (2.36) 
Oy = —m = x= € 2625: E : 
E E =Ó j 2mo 


By the same changes of variables, we get 


oy = E[(X —my)’] 
+00 2 
= > | ue 2%? du 
J2r0 Jo 


= eae [- vi” 
J2n0/2 0 


202 +00 


=o’. (2.37) 
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The Gaussian random variable is completely characterized by its mean my and its variance 
o? since the higher moments can be expressed as a function of these first two moments. 
This is shown in the next section. Thus, the probability density function of a Gaussian 
variable is 
_ emy)? 
2 


1 
= — e *% , 2.38 
fx@) nee e (2.38) 


Example 1.8 Let X be a Poisson random variable with parameter à. It means that X takes 
non negative integer values with probability: 
vk 
Prob(X = k) = ae Vk =0,1,2,...,00. (2.39) 
Using the physical definition already introduced, the mean value of X is defined by the 
expression: 


E[X] =) kProb(X =k) = Soke *—. (2.40) 


k=0 k=0 
To compute this value, we use the following simple and useful technique. Let 


(oe) 


h(x) = Xant, (2.41) 
k=0 
in their convergence domain, we get: 
' dh) _ + a_l k 
ra=- = X kax! = X kapx*, (2.42) 
k=0 k=0 
thus 
[o0] 
X kax! = xh'(x). (2.43) 
k=0 
Using the same calculations we obtain: 
[o6] 
2 k d / 
Xi Ka,x* = x —(xh'(x)). (2.44) 
dx 


k=0 


Now we apply this technique to the Poisson distribution. From 


CO ak ‘ 
SS ane (2.45) 
k=0 
one gets 
Sana d, i 
2i = Az (et) = he’. (2.46) 
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Then 
my = E [X] = eie =A. (2.47) 


The second order moment becomes: 


CO 
E[xX*]= 5 k?Prob(X = k) 
k=0 


00 
-paat 
k=0 

= Ae) 

=e") [aet + e] 

=AA+ 1D. (2.48) 
Which implies 

oy = E[X*]-—my 
=AA+1)—-2 


=). (2.49) 


It is interesting to note that the mean and the variance of a Poisson variable are equal. 


In the following two examples, we will study the binomial variable derived from an intuitive 
approach. 


Example 1.9 LetX be a binary variable such that: 
Prob(X = 1)= p and Prob(X =0)=1-— p. (2.50) 
Then its mean is 
mx=pxl+(1-p)x0=p (2.51) 
and its second order moment is 
M? =px? +(-p) x° =p. (2.52) 
Its variance becomes: 
og = M% — (mx? 
=p-p 
= p(l- p). (2.53) 
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Example 1.10 Y is a binomial random variable defined by 
N 
Y= 5 X;, (2.54) 
i=l 


where X; are independent binary variables. It follows that Y takes its values in the finite set 
{0, 1, ...,N} with probability: 


Prob(Y = k) = (7) a — py. (2.55) 
Its mean is: 
N 
= )>kProb(Y = k) 
k=0 
N 
=X k) pa = py 
k=0 
= Np. (2.56) 
Similarly, its second order moment is: 
M? = E[Y?] 


N 
= XK Prob = k) 


k=0 
N 
=) e)a- p~ 
k=0 
= Np + (N — 1)p). (2.57) 


Hence, its variance is: 
on 2 2 
oy = E[Y*]—my 
= M? — my 


= Np(1 — p). (2.58) 


2.1.4 Characteristic Function of a Random Variable 


Let X be a random variable with density function f(x). The characteristic function of X is, by 
definition, the Fourier transform of its probability density function, i.e.: 


+00 
x(w) = E [e"*] = f: e ™* Fy(x)dx. (2.59) 
—0o 
The characteristic function is an important analytical tool which enables us to analyze the 
sum of independent random variables. Moreover, this function contains all information specific 
to the random variable X. 
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Taking the Taylor series of the characteristic function, we get: 


+o | © (7, vk 
ox(uy = f È Cw opto (2.60) 
œ Lk=0 ; 


and switching the integral with the summation, we have: 


+00 |. k +00 +00 ,. k 
x(w) = cw) x" fe(ajdx =Y Cw) M% . (2.61) 
k=0 7 TOR. k=0 j 


This characteristic function contains all the moments of X, so we can easily deduce that: 


eau (2.62) 


1 d‘ox(w) 
Mi = E[x*] = e] 
x | l w=0 


Example 2.11 From a Gaussian random variable with the probability density function 


2 


eir, (2.63) 


1 
fx) = R 


we can prove that its characteristic function is: 


a2 
px(w) = e77", (2.64) 
Developing in power series the characteristic function, we obtain: 
oo ak Ik oo 2k ou NOE 
o (—w°) o (2k)! (iw) 
ox(w) = >> (5) ——— =>) ( ) l (2.65) 
= 2 k! ss f2. k! (2k)! 


By doing a term by term identification, we observe that the odd order moments are all equal 
to zero and the even order moments are given by: 


x OO! (0?) 
My = a (Z). (2.66) 


Example 2.12 For the case of a Poisson random variable with parameter A, the probability 
mass is: 
k 


À 
Prob(X =k) =e", k=0,1,2,.... (2.67) 
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Its characteristic function is: 


x(w) = E [e”*] 

+00 

X eit Prob(X = k) 
k=0 


+00 Ns 
—À iw yk 
ae Daa 
k=0 


=¢ re". (2.68) 


Using the Taylor series of the first and second order, we find the mean and the variance as 
follows. From the expression, 


d ii fi 
ao) =e- = ge") = meer ee, (2.69) 
dw 
we obtain 
ld 
my = M} = E | =a (2.70) 
i dw w=0 
And from the expression 
d? PA da . 
x(w) = ere he aie) T aet) tere Aei” BOALA) 
dw? dw? 
= ee" n + ire} , 
we find 
1 exw) 
M, = |= =) +à. 2.71 
4 E dw? Juno T oi) 


This result is identical to the one we have already obtained. 


2.2 RANDOM VECTORS 


So far, the concepts we introduced on random variables can be generalized to the multi- 
dimensional case. For simplicity, we focus on random vectors of two dimensions; formulas 
for the case of n dimensions can be obtained using a similar procedure. 

We built a random variable by making an application of the random experiment to the real 
axis using a mapping from universe Q (set of all possible events) on the real axis. This mapping 
defines a random variable if it is measurable. This allows us to construct the algebra of all 
possible events on the real axis. 

Now, if we look at a mapping of universe Q into the Euclidian space R”, and if this mapping 
is measurable, we will be able to define a random vector of dimension n. 
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This measurable mapping enables us to transform events in Q into events in R”. That way, 
instead of studying probabilities in Q, we study the probabilities of events defined in R”. As 
we have already discussed in the one dimension case, events in R” are simply the Borel events 
of the R” space. 


2.2.1 Cumulative Distribution Function of a Random Vector 


Let X be a two-dimensional random vector: X = (X1, X2). The joint cumulative distribution 
function of the random variables X; and X>, also called the cumulative distribution function 
of the random vector X, is given by: 


Fx,,x (x1, X2) = Prob({X; <x; and Xz <x). (2.72) 
This probability is equal to the probability of event A with 
A={w suchthat X\(w)<x,; and X2(w) < x2}. (2.73) 
To simplify the notation, we often use F’y (x) instead of 
Fy, ,x,(%1, x2). (2.74) 


Directly from the probability axioms, we find that this joint cumulative distribution function 
has the following properties: 


1. 0 < Fx, x %1, x2) < 1, 

2. Fx,.x,(+00, +00) = 1, 

3. Fy, x (%1, —oo) = 0 and Fx, x, (—00, x2) = 0, 
4. if xı < x| and x2 < x} then 


Fy, x,(%1, X5) + Fx, .x (1, x2) 


— Fy, x (x1, x4) — Fx,,x, (X1, x2) = 0. 


2.2.2 Probability Density Function of a Random Vector 


As with the case of a single random variable, in the vector case, we can build a function called 
the joint probability density function of the variables X; and X2 such that when integrated on 
an event E of R? we obtain the probability of this event: 


probe) = | f fe x(n drida, (2.75) 
E 


The function fx, x, (x1, x2) is called probability density function because when integrated 
on an infinitesimal interval E = {xı < Xı < xı + dxı and x2 < X2 < x2 + dx2}, we have: 


Tf fx x:&1, X2)dxıdx2 = Fx, x (x1, X2)dxıdx2 
E 
= Prob({x; < X; < xı + dxı 


and x2 < X2 < x2 + dxX2}). (2.76) 
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This construction allows us to say that: 


3? Fy, xa (11, X2) 


; = eee 2.77 
fxi. XX1; X2) TE (2.77) 


The simplest way to obtain this relation is to consider the repartition function defined by: 


x2 x1 
Fx, x (X1, X2) = f / fx. x(u, v)dudv. (2.78) 
—0CO J —CO 


Computing the derivative of this expression on both sides, we find the required result. 
When the random variables X; and X3 are discrete, we prefer to use the term probability 
mass: 


Prob(X; =x; and X= x), (2.79) 


where x; and x2 are discrete values taken by X; and X2. 


2.2.3 Marginal Distribution of a Random Vector 


Let (X, Y) be a random vector having the joint cumulative distribution function Fx, y(x, y). If 
we are only interested in one of the two variables, for example X, we can define its cumulative 
distribution function Fy(x). This function is the marginal cumulative distribution function of 
X. We can deduce it from the joint cumulative distribution function Fy y(x, y). In fact 


Prob(X < x) =Prob(X <x and Y < +00). (2.80) 
This gives 
Fy (x) = Fx,y (x, +00). (2.81) 


The marginal probability density function of X is given by the derivative of the marginal 
cumulative distribution function: 


fx(%) = EE (2.82) 
dx 
Since 
x y 
Fy y(x,y)= f f fx y(u, v)dudv, (2.83) 
then 
x +00 
Fy(x) = Fry(s, +00) = f fx y(u, v)dudv. (2.84) 
Implying 
d 
fx&) = g re +00), (2.85) 
x 
from which 
+00 
fx) = f fxy(x, v)dv. (2.86) 
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For practical purposes, in order to eliminate a variable, all we need is to integrate the function 
from —oo to +00 with respect to the variable in question to obtain the marginal density. 


Example 2.1 Consider a Gaussian random vector with probability density function: 
1 
2m \/ 1— p?oxoy 


-1 (x—my i Q-my) 2p z: 
as gae aa axoy ©MM) 


fxyQ@,y)= 


xe 
(2.87) 


This function represents the joint probability density function of two Gaussian random 
variables X, Y. The parameters my, my, ox and oF represent the means and variances of the 
two random variables X and Y respectively. o is known as the linear correlation between 
them; we will come back to this concept of dependence later. 
The marginal probability density function of X is obtained as follows: 
+00 


fx@) = Sx.y@, y)dy. (2.88) 


—oo 
All we need is to integrate the joint probability density function from —oo to +00 with 
respect to Y. After integrating, we get the marginal density of X: 


1 (remy)? 
Fa) = e cer (2.89) 
Ox 


This means that the marginal density of X is Gaussian with mean my and variance of. 


2.2.4 Conditional Distribution of a Random Vector 
For this section, we need to remember that the conditional probability of events is: 


Prob(A A B) 
Prob(A|B) = “rob (2.90) 


More generally, if A represents the event A = {X < x}, then 
Prob(A|B) = Prob({X < x}|B) = Fx(x|B). (2.91) 


Hence, the cumulative distribution function of a random vector X conditional to event B is 
by definition the probability: 
Prob({X < x} A B) 
Prob(B) 


(2.92) 


This general definition allows us to construct the following distributions. For the case where 
B = {Y = y}, we will get an indetermination since if Y is continuous, we have Prob(Y = y) = 
0. However, this indetermination can be removed when considering the following event B: 


B = {y < Y < y +dy}. (2.93) 
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With this definition, the probability of B is infinitesimal. When dy tends toward 0, we find the 
result we were looking for. It means that 
Prob({X <x}N{y <Y < y+dy}) 


TARS Prob({y < Y < y+dy}) ` a 


Under the hypothesis that Y is continuous, the probability of the infinitesimal event is propor- 
tional to the infinitesimal interval: 


Prob({X < x} A {Y < y +dy}) — Prob(X < x} N {Y <y) 


Fy(x|B) = 
x(a Frody 
F , d a F ’ 
_ Fx, y +dy) — Fy y(x, y) (2.95) 
fy O)dy 
This gives us 
[ġFxræ ydy 2Pere.y) 
Fx(x|B) = = AOR 
The conditional probability density function becomes 
a 
a 0 ay hx, y) 
-|F = , 2.97 
fly) = FE) = 5 l 7G) R 
so 
Zp Fx ra, y) fxy@, y) 
fely) = = oe 
fro) fro) 


Thus, the law governing the conditional probabilities is similar to Bayes’ rule. 

For the case where the conditional probability density function fx(xly) is equal to X’s 
marginal density, fx(x), it means that Y’s realization doesn’t affect X’s statistics. Then we 
say that X and Y are two independent random variables. Thus, for two independent random 
variables, their joint probability density function is given by the product of their respective 
marginal densities: 


fxy&, y) = fx) fy). (2.99) 


Example 2.2 (a) Taking the Gaussian vector example (X, Y). The joint density is 
1 
fxy@, y) = ———_ 
21 y 1- p2ox oy 


-1 J @-myY | 0- 
20—=p2) 2 a 


2; 
J 2 
P- er EMO m) 


xe 
(2.100) 


The marginal density of Y is 


fro)=——e ”. (2.101) 
Y 
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It follows that X’s conditional density with respect to Y is 


fxy&, y) 
fro) 
1 


= park | 
J 21/1 — p2ox 


It is interesting to note that this conditional density corresponds to a Gaussian random 
variable’s probability density function with mean given by 


fx@ly) = 


5 
2po 7 
a—my— 722% (y—my)| 


(2.102) 


200 
mx + L Gaz) (2.103) 
Oy 


and variance 
(1 — p*) ox. (2.104) 


This means the average value of X conditional to Y depends on the value of Y. 
(b) Let’s now consider two independent random Gaussian variables X and Y with means 
mx and my respectively and variances ox and oF respectively, then the joint density is 


1 -pf eax", om? | 
fxy@, y) = -———e 2 "r 
2m ox oy 


fx) fro). (2.105) 


As we can note from this joint density function, the independence of the two random 
Gaussian variables corresponds to p = 0. 


2.2.5 Mean, Variance and Higher Moments of a Random Vector 


As in the case of the scalar random variable, we can define the mean and the variance of joint 
random vectors and the moments of higher order as: 


= = xfx(x)dx 


+00 +00 
= f I xfx. y(x, y)dxdy. (2.106) 


E[X’] = f - x? fx(x)dx 


CO 
+00 +00 
= f i x fx y(x, y)dxdy. (2.107) 
—oo —0oo 
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ox = E [X°] -m 
+œ p+ 
= f / (x — mx) fxy(x, y)dxdy. (2.108) 


We can also define the same quantities for Y. 
In a general fashion we define X’s nth order moment noted M% by: 


E [x"] ` D x” fx(x)dx 


aes +00 
= / f x” fx y(x, y)dxdy. (2.109) 
—00 —C 


Similarly, the joint moments of order n, m of X and Y are: 
My & E [x"Y"] 
+00 +00 
= i / x” y” fy v(x, y)dxdy; (2.110) 
—0o —oo 
and the centered order n, m moments 
Cey = E[(X —my)"(¥ — my)" 


+00 p+ 
= f / (x — mx)" (y — my)" fry, y)dxdy. (2.111) 


Among the centered moments, the most important one is C zy called the covariance between 
X and Y. The matrix 


ch! ch! 
A= | BEE gut | (2.112) 
Y,X YY 


is called the variance-covariance matrix of the (X, Y) vector. This matrix can be explicitly 
expressed by: 


a=e| (Fx) (x-mx¥-my)], (2.113) 


Y — my 


Appendix A provides an introduction to matrix algebra for those unfamiliar with it. The 
correlation coefficient between two variables X and Y is by definition: 


EKX-mx)¥ —-my)) Cyy 


pÊ = ; (2.114) 
OxOy OxOy 
We write X = (X4, ..., X„)! to be the n dimension random vector. Since the random vector 
X ’s dimension is n, the covariance matrix’s dimension is (n, n) and is defined: 
A=E|[XK-my)K-my)'], (2.115) 


with my = E [x] = (mx,, . ..,mx,)! the mean of vector X. 
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Example 2.3 For the Gaussian vector having as density function 


1 
fxy@, y= 
27/1 - pox oy 
= x—my)? y-my 2 2 r 
O | E ae mom} (2.116) 
the covariance between X and Y, E[(X — mx)(Y — my)], is defined by: 
1 +00 +00 
= ae (x —mx)(y — my) 
21 / 1— p?oxoy J k 
= (x-my)?_, -my 2p > 
ce | 2 T Sy oy Mx MY tok a (2.117) 
The evaluation of this expression gives: 
E(X —mx)Y —my)] = poxoy. (2.118) 


Thus, (X, Y)’s joint probability density parameter p is the correlation coefficient of X 
and Y. 
The Gaussian vector (X, Y)’s variance-covariance matrix is: 


2 
a=| Ox eal (2.119) 


Pox Oy Oy 


its determinant is: 


|A| = oxay7(1 — p°), (2.120) 
and its inverse is: 
aian! | ep pora (2.121) 
(1 — p?)ožo? | —Poxoy ož i i 


The joint density of variables X and Y can be written more compactly as: 


y 3[X—mx,¥—my A177" )] 
2x |A|? 
Generally, a n dimension random vector is said to be Gaussian if its joint probability density 
function takes the form: 


fxy&,y)= (2.122) 


fx (x) l F (e-m) a'am 


= ——_____e (2.123) 
7 Qr)? |A|? 


where X = (X], S COLA mzy =E [x] = (mx,, „my, ) and A is the variance- 
covariance matrix of X. 

This density contains the inverse of A, and is only defined if A is a matrix of full rank. 
If A is singular, it means that at least one component of X is a linear combination of the 
other components. 
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2.2.6 Characteristic Function of a Random Vector 


Let (X, Y) be a 2 dimensional random vector with probability density function fy y. The joint 
characteristic function of vector (X, Y) denoted by x, y (w1, w2) is defined by the Fourier 
transform in 2 dimensions of the vector (X, Y) and can be written as: 


+00 +00 , 
dxy(wiwa)= ff em fyt, yydxdy. (2.124) 
=œ J- 
It is very important to note that when X and Y are two independent random variables, we have 


fxxy&, y) = fx) fr). (2.125) 
Which implies: 


ox,y(w1, w2) = bx (wy dy (w2). (2.126) 


This relation can be generalized for the case of random vectors with dimension greater than 2 
as we show below. 

As in the case of a scalar, we can use the characteristic function to obtain all moments of 
all order of variables X and Y as follows 


1 a"t" dy y(wy, w2) 


BLE Garage ee 


(2.127) 


In the more general case of a n dimensional random vector X having probability density 
function fx (x), its characteristic function ¢y is: 


ox(w) = E[e'*] 


e 
T IX Riut KIN Xa) 
R” 


x efwinitwrrt..Awirn dy dx ...dXn. (2.128) 
In the case of independent vectors, we have: 


(I> 


ow=f ff Fx, 1) fx. 2) -- + fx, On) 


x ei win turret. +w) d y dx ...dXp 


= | facneman f fecare™an... 
R R 


x f fr,Gindel™dey 
R 


=|] f fx, er dxg. (2.129) 
k=1 "R 


Then, for all independent n dimensional random vector X, one has: 


ox(w) = | | ox. (wo. 
k=1 
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Example 2.4 Let X be a Gaussian random vector with density: 


1 ‘ ips, 
f(x) = ———e 22" ®) (2.130) 
7 (277)? |A|? 
where A is the variance-covariance matrix of X, with 
A= Ogre, j (2.131) 
and 
Àkj = E[(X; — mX; —mj)]. (2.132) 


Thus the joint characteristic function of X is: 


gy (w) = elt wew Aw, (2.133) 


2.3 TRANSFORMATION OF RANDOM VARIABLES 


Let X be a random variable with probability density function fy(x). Let’s consider the random 
variable Y obtained from the transformation Y = g(X). We would like to find the probability 
density function of fy(y) in terms of fx(x) and g(X). In order to achieve this, we look at the 
event 


A={y <Y <y+dy}. (2.134) 
If Y is a continuous random variable, one has: 
Prob(A) = fy(y)dy. (2.135) 
Let B be the inverse image of A, defined as B = g7! (A). It is given by: 
B= {(x; < X < xı + ldx) U (x2 < X < x2 + |dx|)U... 


UCtn < X < xn + ldxnl)}. (2.136) 
Thus we have 
Prob(B) = X` fx@x) ldx]. (2.137) 
k=1 


Since A and B must have the same probability, Prob(A) = Prob(B), one finds: 


Frody = D> fx (xe) ldx], (2.138) 


k=1 


which implies 


n d 
fr) = Yo few) | 
k=1 y 
=5 f) (2.139) 
dy 
k=1 | Jx = 
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xx’s being the solutions of y = g(x), x, = g7!), k = 1, 2, ..., n. This simple relation is very 
useful for the simulation of random variables on a computer. 


Example 3.1 Affine transformation of a uniform variable. 
Let 


a<x<b 


elsewhere (2.140) 


tel 
fx) = [= 


be the density function of a uniform random variable X. If we set y = ax + 6, then x = ve 
For this transformation, only one solution to the equation y = g(x) obtains or x = g7! (y) 


exists. The probability density function of variable Y is 


fx) 


dy 


dx 
_ fx) 
la| 


= E q&a + B < y = ab + B 
~ 10 elsewhere. 


fro) = 


(2.141) 


One can see that an affine transformation of a uniform random variable gives rise to 
another uniform random variable. 


Example 3.2 Affine transformation of a Gaussian variable 
Let X be a Gaussian random variable with probability density function 


1 _ amy)? 
fx&) = T e x , (2.142) 
TOX 


Let y = œx + 6, then x = ZE, The probability density function of Y is 


fx) 


dy 


dx 


_ fx) 
el 


fyro= 


1 Ni ar aL y 
= ———e x , (2.143) 
~v 2r |a| ox 


The affine transformation transforms a Gaussian random variable into a new Gaussian 
random variable whose mean and variance can be easily computed: 


E[Y]=my =amx+ß and of = g&o}. (2.144) 
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Example 3.3 Log-normal distribution 
Let a Gaussian random variable X have the probability density function 
o) =e 
fxQ@) = ETF e€ % 
The transformed variable Y from X given by the equation y = e* = g(x) has a unique 
solution x = log(y). The probability density function of Y is given by 


(2.145) 


fry) = BO 
dy 
dx 
_ fxQ@) _ fx(og(y)) 
Sa > 
1 1 _ dog(y)—m)? 
202 (2.146) 


= ——— e 
Y V2TO 
This probability density function of the log-normal law is very useful in finance and we 
will explore it later. 


Example 3.4 LetX be a random variable of uniform density: 


ei 
fxQ@) = [g 


O<x <2 


elsewhere. (2:147) 


Consider the modified variable Y from the variable X: y = sin(x). This transformation 
forces lyl < 1, which means that fy(y) = 0 for lyl > 1. The equation y = sin(x) has two 
solutions for 0<y < 1: 


xı = arcsin(y) and x= m — arcsin(y), (2.148) 
and for negative y, (—1 < y < 0), 
xı = m — arcsin(y) and x= 27 + arcsin(y). (2.149) 


The arcsin(y) function is a function taking values only in the interval [- oe z] . Thus for 0 
Syl, 


fx@) zn fx (2) 


FOST =n (2.150) 
dx | _ dx | _ 
si X=X2 
with 
dy 5 
el lcos(x)| = y1 — sin(x}? = y1 — y?. (2.151) 
x 
Since this derivative is independent of the points xı and x2, then 
a 1 
fro) = 2 + 2 
i vi-y» y1- 
1 
= ——_.. (2.152) 
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Example 3.5 Let X be a continuous random variable having probability density function 
fx (x) and cumulative distribution function Fy (x). Then 


d 
fx(®) = geo (2.153) 
x 


Let’s consider a new random variable Y defined by the transformation y = Fy(x). This 
transformation restricts y to the interval [0, 1]. Then the probability density function for Y 
is given by: 


(x) 
froy= = (2.154) 
dx 
and since 
dy dFy (x) 
“= = 2.1 
ag PP fx), (2.155) 
it follows: 
ib oagal 
fr) = | 0 elsewhere. (2.156) 


This formula is interesting since it can be used to generate random variables. 


Example 3.6 


(i) Let X be a random variable uniformly distributed between 0 and 1. We seek to determine 
the transformation Y = g(X), i.e., 


x=g 0) (2.157) 
such that the new random variable Y has the probability density function fy(y). From 
the last example’s result, we will study the mapping y = F yŒ) where Fy(y) is the 


distribution function of Y and F s 1 (x) is the inverse function of Fy. Thus, the probability 
density function of Y is given by: 


(x) 
pot a (2.158) 
dx 
Then we have 
3 dy _ dFy'(x) 
i — =, 2.159 
y= Fy @) = ce ay ( ) 
Finally, this density is equal to a with x = Fy(y) where 
dx 
ay = fro: (2.160) 
y 


(ii) Application: Let X be a random variable uniformly distributed on the interval [0, 1]. 
Find y = g(x) such that Y has probability density function we™® 1 1, > o}. The cumulative 
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distribution function of Y is 


y 
Fy(Qy) = / ae “dx = (1—e ™ )1ys0}. 
0 


1 
y= log(1 — x). 


a 


Then, y = Fy (x) with x = Fy(y) = (1 — e 14, > 0}. Finally, we have 


(2.161) 


(2.162) 


2.4 TRANSFORMATION OF RANDOM VECTORS 


Let (X, Y) be a random vector with density fy y(x, y). Consider the mappings V = g(x, y) and 


W = h(x, y). Let’s define the event 


A={(u < V <v+dv)N(w<W <w+dw)} (2.163) 
and B be the inverse image of A defined by: 
g=] i< X su tldapain <¥ < yi +idyip}..- (2.164) 
U.. U {On < X < Xn + ldx) N On < Y < yn + ldyn D j l 
Since A and B have the same probability of occurring, we conclude: 
fvw, wdvdw = È fx, Qe, Ye) \dxedye . (2.165) 
k 
This implies: 
fwo, w=) ferae Ye) — ; (2.166) 
k dxdy aa 
X=Xk,Y=Yk 
where one is the Jacobian of the coordinates’ mapping defined by: 
ðv ðv dg dg 
ox oy ox dy 
= = 2.1 
ðw ðw ðh oh ae 
ox oy ox dy 
Let the n dimensional random vector X = (X1, ..., X al have the density function fy (x). 
Denote Y as the new random vector obtained from X by the transformation 
Y = 9(X), (2.168) 
with 
Ye = ge(X) = 8(X1, X2,..-, Xn), k= 1,2,...,0. (2.169) 
Suppose that Y = g(X) has an inverse image composed of m vectors x 1 x?,..., X”, that is 
PS); i=1,2,...,m, (2.170) 


where the m vectors xi, i = 1,2,...,m, transform into the same vector y. 
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Let 7 be the Jacobian of the transformation defined by 


Oy, Oy, 
an Pe Saar 
I(x) = : l : , (2.171) 
Yn Yn 
Ox, Ixa 


fry) = are (2.172) 


Example 4.1 Affine transformation of a Gaussian vector 

Consider the Gaussian random vector X with mean my and covariance matrix Ax, de- 
noted by N(mx, Ax). Set ¥Y = UX + n, where & is a matrix with appropriate dimensions. 
Then the affine transformation changes the Gaussian vector X into a new Gaussian random 
vector Y with mean and variance-covariance matrix given by 


E[Y|]=my = Emx+u (2.173) 
and 


Ay = DAx='. (2.174) 


Example 4.2 
(i) Let (X, Y) be an independent Gaussian random vector with the probability density 
function: 


atte 
fxy(. y) = rar dor, 


(2.175) 


SetV = VX? +Y? and W = arctan({). If we restrict W to [0, 27], this transformation 
forces V to be non negative and the inverse image consists only of one element: 


X = V cos(W) and Y = V sin(W). (2.176) 
Then the density is 
Ox ax 
ðv ðw 
fvw, w) = fxy&,y)| J| where J = f (2.177) 
dy dy 
ðv ðw 


The determinant of J is 


cos(w) —v sin(w) 
sin(w) v cos(w) 


IJ| = =v. (2.178) 
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Which implies 


v2 
ey yao. (2.179) 


fvw, w) = To? 


This joint density of V and W is interesting since we recognize that: 


2n 
vo _ wv 
fv(v) = fvw, w)dw = -ia lw>0}: (2.180) 
0 


Which allows us to write: 


fvw, w) = fv) fw), (2.181) 
where 


0<w <2 


elsewhere (2.182) 


fw(w) = t 


This implies that V and W are two independent random variables. The random variable 
V is called a Rayleigh variable. 

(ii) Also from the random variables V and W, we could reconstruct the independent Gaus- 
sian random vector (X, Y) as follows: 


X = V cos(W ) and 
Y = V sin(W). (2.183) 


2.5 APPROXIMATION OF THE STANDARD NORMAL 
CUMULATIVE DISTRIBUTION FUNCTION 


In the rest of the book, we frequently use the Gaussian cumulative distribution function. This 
function is usually denoted by N(.). There is no exact formula to compute the Gaussian normal 
cumulative distribution function N(.). Tables of values of N(x) are available in most statistical 
or mathematical books. 

However, it is important to be able to compute values of N(x) in algorithms without having 
to refer to a statistics table. For this, we will give an approximation of the exact value of N(x) 
which is accurate up to six decimals. 


N(x) © 1—n(x)(ayk + ack? + azk? + agk* +ask°) if x > 0, (2.184) 
and 
N(x)=1—N(-x) ifx <0, (2.185) 
where 
1 
kanm, 
1+yx 
y = 0.2316419, 


a, = 0.319381530, 
a2 = —0.356563782, 
a3 = 1.781477937, 
—1.821255978, 
as = 1.330274429, 


a 
fN 
II 
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and the probability density function is 


n(x) = N’ 


| 
2 
m= 

a 
Ym 


1 
= eel? (2.186) 


For example, with the MATLAB® software, N(x) can be computed using the function: 
normcdf . 
DistNormCum=normecdef (x); 


Appendix B contains a list of commonly used functions in MATLAB. 


Notes and Complementary Readings 


For further readings on the topics covered in this chapter, the reader can consult the following 
books: Ross (2002 a and b), Hogg and Craig (1995), Wackerly, Mendenhall and Scheaffer 
(1996) among others. 
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Random Sequences 


The preceding chapter has enabled us to describe and characterize random variables. However, 
many real phenomena, and particularly those encountered in finance, can be modeled by a 
sequence of random variables. This means that we need an asymptotic characterization of 
these random sequences in order to deduce their properties. These properties will be useful in 
many cases as we shall see. 

Convergence and limit concepts studied in this chapter will be useful when we have to 
characterize the precision, convergence and control of simulation techniques presented later. 

To construct the probability of events, we resort to the notion of relative frequency obtained 
by realizing a large number of random experiments and counting the number of outcomes. 
This repetition of experiments implies the use of a large number of random variables. We will 
study a very important property in statistics theory that will be useful later. 


3.1 SUM OF INDEPENDENT RANDOM VARIABLES 


Definition 1.1 {X;},=1.2, ... oo is called a sequence of independent and identically distributed 
random variables if all the variables X, have the same probability density function and any 
arbitrary number of these variables form an independent random vector. 


Let {Xz }k=1,... œ be a sequence of independent and identically distributed random variables. 
One can construct the following series: 


Sn =D) Xe (3.1) 
k=1 
and 
M, = : yx (3.2) 
n= ñ a ke i 


M,, is the normalized sum of the X;,’s and is often called the arithmetic mean of these n 
variables. Their means are easy to calculate: 


E [S,]= E be x 
k=1 
=) Ei] 
k=1 


=nmy (3.3) 
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and 
1 
E [Mn] =-E [Si] 
n 
= My, 


where my is the shared mean of all the X; variables. 
The variance of S,, is 


Var [Sn] = E [S7] — E [Sn 
where 
E [S } = nm}. 


Then we need to compute E [S3] as follows: 


E[S] =E Soy 


k=1 j=1 
n 

=E Ba +E| OXX; 
k=1 k#j 


n 


E [Xz] +) E [XX]. 
k=1 kj 


Since X; and X; are independent when k Æj, we have: 


E[X,X j] = E[X,JE[X |], 


thus 
E[S] =) EK] + om 
k=1 k#j 
=nE [x°] + (n? — nym. 
Which implies 


Var [S,,] = nE [x°] + (n? — n)m?, — n?m? 
=n [E [x°] — my | 


2 
= Noy, 


(3.4) 


(3.5) 


(3.6) 


(3.7) 


(3.8) 


(3.9) 


(3.10) 


where o? is the variance of the random variables Xz. This result is well known since the 
variance of a sum of independent random variables is equal to the sum of their variances. 


To obtain the variance of M„, we note that M, = tSn, then 


1 
Var [M,] = zz Var [Sy] 
ox 
a 


(3.11) 
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3.2 LAW OF LARGE NUMBERS 


Previously, we have intuitively studied the relative frequency that a given event occurs. We 
expect that when the number of experiments becomes very large, this relative frequency should 
converge to the event’s probability. In reality, this relation is obtained from the law of large 
numbers regardless of the studied distribution. In fact, we have the weak and strong law of 
large numbers. The weak one is as follows. 

Let {Xz }z=1, ... oo be a sequence of independent and identically distributed random variables 
and M, = 1 X 4—1 Xx the arithmetic mean having finite mean and variance. We already found 


the mean and variance of M,,: E[M,,] = my and Var [M,,] = an 


Definition 2.1 Chebychev inequality: Let X be a random variable with expected value u and 
finite variance o°, then for any positive number k, we have 


2 
Prob (|X — u| > k) < a (3.12) 


or equivalently 


1 
Prob (|X — u| > ko) < oe (3.13) 


On the one hand, applying this above Chebychev’s inequality, we have: 


2 


Ox 
Prob (|M, — mx| > €) < = (3.14) 
nE 
or more precisely 
z 
Prob (|M, — mx| < ©) > 1- 5. (3.15) 
ne 


This inequality means that, for any £, we have: 
lim Prob (|M, —my| < €)= 1. (3.16) 
n— Cc 


This is exactly the formulation of the weak law of large numbers. It tells us that when the 
number of experiments tends toward infinity, the empirical mean converges in probability to 
the mathematical expectation. 

The fact that this empirical mean converges to its expected value regardless of the distribution 
law of the X;’s shows us that our intuition about the relative frequency was right. This result 
allows us to compute the expected value of the mathematical expectation using the empirical 
mean estimated on a very large number of experiments. 

On the other hand, the second formulation of the law of large numbers is more complicated 
to prove but confirms that it is proper to use the mean in more complex situations. The strong 
law of large numbers states that 


Prob ( lim M, = mx) =i (3.17) 
n— Oo 
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We refer to the Chebychev’s inequality to gauge the mathematical expectation estimate pre- 
cision when using the arithmetic mean. Effectively, we can rewrite this relation as: 
1 
Prob (|M,, — mx| < €ox) > l= (3.18) 
ne 
é is called the relative precision and | — is called the confidence interval. 
Then, to get a precision of 1% with a 95% confidence, we must choose n such that: 
1 1 


1 =1 ~ 0.95 3.19 
ne? n(0.01)2 el) 


or 
1 
n rey 
0.05(0.01)? 


This number seems relatively high with respect to the desired precision. This phenomenon 
is caused by the fact that the Chebychev’s inequality has been established for all laws of 
probability, which means that the bound is relatively large. 

In the following section, we will look at an approach that allows us to decrease the required 
number of simulations. 


= 200000. (3.20) 


3.3 CENTRAL LIMIT THEOREM 


The physical observations due to the superposition of a large number of effects can be char- 
acterized as having a distribution with a bell shape (Gaussian). This theorem has been well 
established since the beginning of the 20th century. 


PEETS 


1 n 
= Tae (> x) = nm , (3.21) 
k=1 


it immediately follows that the mean of Z, is zero and its variance is one: 


E[Z,]=0 and Var[Z,]= 1. (3.22) 


Theorem 3.1 The central limit theorem states that when n approaches infinity, this random 
variable Z, converges towards a Gaussian distribution with zero mean and unit variance. 


Proof: The proof of this theorem uses the characteristic function. An heuristic approach of 
the proof is as follows. 
Let 


x(w) = E [e'”*] (3.23) 
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and 
bz, (w) =E [ere] 
ae [eee (3.24) 


Since the random variables X; are independent, it follows: 


$z, (w) = E [e m( “| es [e] 
= [e je" N . (3.25) 


The Taylor series development gives: 


2 


iw( 2) _ Ww w 2 
Ele nox =Eļ|1+i (X — mx) zX mx) 
/nox 2nox 


+0 (—) |: (3.26) 


Finally, 


; 5 2n n/n i ` 
W hen n approaches infinity, 


w 


2\” ; 
$z, (w) > (1 e =) xe, (3.28) 
2n 


This result allows us to deduce that Z, is a Gaussian random variable with zero mean and 
unit variance. 

This theorem allows us to improve the number of experiments n in the estimation of the 
mathematical expectation by the arithmetic mean. To achieve this, we use the probabilistic 
inequality: 


Prob (a < Zn < b) > (3.29) 


l b 
—— e 
~v 2T 1 
In practice when n gets larger than 30, we could consider Z, to be Gaussian. 
Indeed, we want to compute the necessary number of experiments to obtain e = 1% as 
relatively precise with a 1 — æ = 95% confidence level. We know that 


1 
M, —mx =- Xr- 
mx ian mx) 


= ae Doa -= my), (3.30) 
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thus 


Zp. (3.31) 


From Chebychev’s inequality, we have 


1 
Prob(|M,, — my| < €ox) > l- — = l-a, (3.32) 
ne 
and 
Prob (-e0x < een < cox = Prob (—e/n < Z, < €/n) 
1 eyn zd 
Den e ?dz 
Vr —e/n 
2/2 evi e 
ee e A (3.33) 
~V 2m Jo 
Thus 
n 
Prob (— Zn <— = erf = 
rob (—En/n < Zn < —€4/n) ert (e7) 
=1l-a, (3.34) 
where 
2/2 f 2 2 fe xe 
erf(x) = == | e z7du= — | e 7du, (3.35) 
~v 2r Jo VT Jo 
which implies 
2 : 2 
aa [erfa -o| (3.36) 


3.4 CONVERGENCE OF SEQUENCES OF RANDOM VARIABLES 


Until now we have studied a sequence of independent random variables having exactly the 
same distribution. In this context, we looked at a particular variable corresponding to M,. If we 
study the sequence of these random variables M„, we see that they are no longer independent 
and do not have the same distribution. However, we see that they converge in a particular 
direction given by the laws of large numbers and the central limit theorem. It is therefore 
important to introduce other convergence concepts pertaining to random variables. 


3.4.1 Sure Convergence 


Remember that a random variable is a function resulting from an application of the set of 
events (universe Q) into a set of the real axis. Consider the sequence of random variables 
defined on the same space of events Q. 
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For an elementary event w of Q, X,,(w) is a deterministic numerical sequence. Suppose that 
this sequence converges to a value X(w) when n goes to infinity, 


dim Xn(w) = X(w), (3.37) 


X can be random or not. 

The sequence X, is said to surely converge if the sequence X,(w) converges for all w in Q. 
We can verify this convergence using the Cauchy criterion for all elements w in Q. Intuitively, 
the Cauchy criterion states that when an infinite sequence converges, it converges to a finite 
limit. 


3.4.2 Almost Sure Convergence 


Generally, sure convergence is hard to obtain. It is possible that the sequence X,,(w) converges 
only for certain values w belonging to a subset A; this means that for all w belonging to subset 
A, X,(w) converges to X(w). 

We say that the sequence of random variables X, almost surely converges to X if Prob(A) = 1. 
In this situation, there exists a subset for which the sequence X,, does not converge. However, 
the probability of this event is zero, and thus physically not interesting. We can verify an 
almost sure convergence with the Cauchy criterion for all sequences X,,(w) with w belonging 
to A. 


3.4.3 Convergence in Probability 


As we saw in the case of the weak law of large numbers, we say that a sequence of random 
variables converges in probability, if there exists a random variable X such that: 


lim Prob (|X, — X| > €)=0, Ve > 0. (3.38) 
n— co 


3.4.4 Convergence in Quadratic Mean 


This mode of convergence concerns only the second order moments. We say that a sequence of 
random variables converges to X in quadratic mean or in mean square if there exists a random 
variable X of finite second order moment such that: 


lim E[|X, — X|] = 0. (3.39) 
n— CoO 


This concept of convergence in mean square can be generalized to any order. In the case of 
the first order moment, we say that the sequence of random variables X,, strongly converges or 
converges in mean to X if: 


lim E[|X, —X|]=0. (3.40) 


n— oo 


However, sometimes we are only interested by the distribution of the random variables X,. 
In this case we say that the sequence X,, converges in distribution to a random variable X when: 


lim Fy, (x) = Fy(x). (3.41) 


In this case the moments of X„ converge to the moments of X. 
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If we are interested in a function of X,, the sequence of random variables X,, weakly 
converges to X if 


lim E l(a) = E [BO]. (3.42) 


These two modes of convergence, strong and weak, are very important in Monte Carlo 
simulations. Indeed, we can use the strong or weak mode following the objective of the 
financial problem studied. For example, if we are interested in the average value of a financial 
asset at a given date, we could replace a complex distribution f, by another distribution f 
providing that X converges to the desired value with respect to g(X). 


Notes and Complementary Readings 


Further readings can be found in the following books: Ross (2002 a and b), Hogg and Craig 
(1995), Wackerly, Mendenhall and Scheaffer (1996). 
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Introduction to Computer Simulation 


of Random Variables 


In finance, as in many other areas of social and human sciences, it is often very difficult or even 
impossible to find an analytical solution for most of the problems we have to solve. This is the 
case with the evaluation of complex and interdependent financial products. However, there are 
many numerical techniques available to evaluate these products. One of the techniques that 
we present in this book is the computer based simulation method. From its inherent nature, 
this method offers a wide flexibility, especially when evaluating financial products dependent 
on many state variables. 

How can a computer generate random events, since if it executes a given algorithm knowing 
the initial result, we should be able to guess the results at the outset? In fact, results generated 
on a computer are not conceptually random but, from the point of view of an observer, behave 
like random ones. 

These results occur with a statistical regularity as discussed in the previous chapters. The 
regularity associated with the results generated by a computer can be explained as follows: if 
the algorithm generated by a computer is chaotic then it will produce unpredictable results for 
an observer. Thus, to build a dynamically chaotic algorithm is sufficient to generate random 
events. 

The previous chapters showed how to obtain random variables for which we know the 
distribution function from a uniform random variable. The problem of generating random 
variables on computer is the same as generating a uniform random variable. 

In order to generate a uniform random variable without resorting to a complex chaotic 
dynamic, all we need to do is to build an algorithm such that the generated result has the 
statistical regularity of a uniform random variable. 

If the generated number is expressed with a binary representation, the number of 0 and 1 
should be approximately equal, the number of two following 0 and two following 1 should be 
approximately equal, etc. 

In the last 50 years, we have seen researchers focussing on building algorithms that yield this 
statistical regularity. Because a computer has a finite precision, the generator can’t generate 
an infinite number of different outcomes but only a finite number of results. This means that 
the series of generated results is periodic. Generally, we choose the algorithm that gives us the 
longest possible period. 

In this chapter, we define basic concepts regarding the generation of random variables on 
computers and we present the techniques used to generate some widely used random variables 
in finance. After that, we present the simulation of random vectors using variance-covariance 
matrix decomposition techniques such as the Cholesky decomposition and the eigenvalue 
decomposition. The financial assets variance-covariance matrix decomposition is important in 
the study of risk factors, and more precisely in the pricing of financial instruments and risk 
management. Finally, we introduce the acceptance-rejection and Monte Carlo Markov Chain 
(MCMC) generation methods that could both be used in many simulation cases in finance. 
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In this chapter, as well as in the followings, we include MATLAB® programs in order to 
illustrate the theory presented. A list of most frequently used functions in MATLAB is given 
in Appendix B. 


4.1 UNIFORM RANDOM VARIABLE GENERATOR 


As discussed above, the series of numbers generated by the computer must have the longest 
possible period. The most frequently used algorithm is of the form: 


Xn41 = aX, +b (modm). (4.1) 


The series of results obtained from an initial value corresponds to the realization of a sequence 
of uniform random variables on the set 


{0,1,2,...,m— 1}. (4.2) 


In reality, in order for the period of the series generated with this algorithm to be m, we 
must choose integer a, b and m in an appropriate way, because this period is equal to m if and 
only if 


(i) bis relatively prime with m, 
(ii) a — 1 is a multiple of any prime number that divides m, for example a multiple of 3 if m 
is a multiple of 3. 


The parameters used by MATLAB’s generator are described in Moler (1995) and given by: 


m = 27! — 1 = 2147483647, 
b=0, and 
a = T = 16807. 


This algorithm generates a series of integer numbers between 0 and m — 1. If we divide the 
result by m, we get results in the interval [0, 1]. Since m is very large, the difference between 
2 successive numbers is relatively small and thus we can consider this discrete variable to be 
a continuous variable in [0, 1]. 

In MATLAB, to generate a uniform random variable between 0 and 1, often denoted by 
U(0, 1), we use the command rand. The command rand generates one random variable 
uniformly distributed on the interval [0, 1]. The command rand (m, n) generates m x n 
random variables uniformly distributed on [0, 1] and stocked in a matrix of m rows and n 
columns. 

From the examples presented in previous chapters, a uniformly distributed random variable 
on the set [a, b] called U (a, b) can be generated from the variable U (0, 1) using the relation 


U (a,b) = (b —a)U (0, 1)+ a. (4.3) 


4.2 GENERATING DISCRETE RANDOM VARIABLES 
4.2.1 Finite Discrete Random Variables 


Let the random variable X on the finite space of events {x1, x2, ... , Xn} have the respective 
probabilities p1, p2,..., Pn. We can simulate X with a uniform random variable U(0, 1) in 
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setting: 
X=x, if U<p., (4.4) 
X=x if pı<U <p +p, (4.5) 
X =Xp if Pit Pp2+':: + Pr- < U < pi + p++ Pn. (4.6) 


This method simply means that we divide the interval [0, 1] in n sub-intervals with lengths 
corresponding to p1, p2,..., Pn. In generating a uniform random variable, this subdivision 
uniquely defines X. 


Example 2.1 We want to generate the daily price fluctuation of a financial asset that can 
have an increase of A; with probability pı, decrease of — A> with probability p2, or stays 
the same with probability p = 1 — pı — p2 (each p; should be positive and pı + p2 + p3 
must equal 1). That is, 


Prob(X = Aj) = pı, 
X e {—4,3, 0, Ai} with { Prob(X = 0) = 1 — pı — p2 = ps, (4.7) 
Prob(X = —A>) = po. 
To obtain the realization of X, we generate a uniform random variable U(0, 1) and X is 
given by: 


U<pi = X=Ai, 
p<USptp = X=-A, (4.8) 
Pit p <U => X=0. 


In MATLAB, the program is: 


function X=GenerateVariable (pl, p2, Deltal, Delta2) 
%X takes value Deltal with probability p1 
%X takes value -Delta2 with probability p2 
%X takes value 0 with probability 1-pl1-p2 


U = rand; Uniform random variable 
X=Deltal * (U<=pl1)-Delta2 * (p1<U) * (pl+p2 >= U); 


end 


Example 2.2 Let a binomial variable X = B(N, p) that is a variable taking value in {0, 1, 
2,...,N} with respective probabilities: 


pk = Prob(X = k) = (Y) pa- p=, k=0,1,...,N. (4.9) 


This variable is the sum of N independent binary random variables B; such that 


X = Bi + B2 +--+ By, (4.10) 
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with Prob(B; = 0) = 1 — p and Prob(B; = 1) = p. When N is relatively small, for example 
smaller than 30, we can generate B(N, p) by generating N binary variables B;, each one 
being generated from a uniform random variable U(0, 1) as follows in MATLAB: 


function B=Binary (p) 
$B takes value 1 with probability p 
$B takes value 0 with probability 1-p 


U = rand; 
B= (U<=p) ; 


end 


To generate the random variable X, we can use the recursive form: 


function X=Binomial (NbBinary, p) 
sFunction that simulates the Binomial variable X 


X=0; 

for n=1:NbBinary 
X=X+ (rand<=p) ; 

end 


end 


In MATLAB, using the loop command for... end means that the calculations are per- 
formed many times, that is, it generates the random variables one after another and sums 
them as they are generated. 

When the number of random variables N is small, the computation time of the loop is 
not a big problem. On the other hand, when N is large, computation time of the loop is 
very large and not so efficient with MATLAB. In such situations, we can increase the speed 
of the program using vectors in the algorithm. In doing so, the previous program can be 
written: 


function X=Binomial2(NbBinary, p) 
$Function that simulates the Binomial variable X 


X=sum(rand(1, NbBinary) <=p) ; 


end 


4.2.2 Infinite Discrete Random Variables: Poisson Distribution 


Define a Poisson random variable X with parameter à as follows: 


eT 


Prob(X = k) = , k=0,1,2,...,0. (4.11) 


k! 
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To generate this random variable we can use the previous technique and discard the events 
corresponding to high k, or in generating a series of independent random variables U; uniformly 
distributed on [0, 1], we can prove that: 


Prob(U; > p,U,U2 > p,...,U;U2...U% > p,U1U2...Uk+1 < p) 


p . 
= z CD logo), O<p <1. (4.12) 
In choosing p = e74, we get: 
Ui <e"*} = X =0, 
U, > e and 
UU, < e* | re 
U, > e>, l (4.13) 
UU: > e™>, 
i abo = X =k. 
U,U>...U, > e~* and 
U,U2...U p41 < e* 
This algorithm gives us 
us 
Prob(X = k) = ois (4.14) 


With MATLAB, we can generate such a random variable in the following manner: 


function X=Poisson (lambda) 
SFunction that generates a Poisson variable X 


X=0; 

U=rand; 

while (U>exp(-lambda) ) 
X=X+1; 
U=U* rand; 

end 


end 


4.3 SIMULATION OF CONTINUOUS RANDOM VARIABLES 


Let F be a given continuous cumulative distribution function and U a uniform random variable 
on [0, 1], then the cumulative distribution function of the random variable 


X =F '(U) (4.15) 


is F (this was shown in Chapter 2). Now this result helps us to generate variables with known 
distribution functions. 
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4.3.1 Cauchy Distribution 


The probability density function of the Cauchy distribution is: 


1 

——————— 4.16 
f= aap (4.16) 

and its cumulative distribution function is: 

1 f* d 1 
Fx(x) = = J. ra = [arctan(x) = = | = (4.17) 
Thus 

U = F(X) = X = Fy'(U) = tan (zU + =) (4.18) 


Hence, to generate X, we use the relation: 


X = tan (« (v + ;)) ; (4.19) 
1 
X = tan (« (v — ;)) ; (4.20) 


since the tangent function is periodic with period z. Such a variable can be generated with the 
following MATLAB program: 


or 


function X=Cauchy 
$Function that generates the Cauchy variable X 


X=tan (pi* (rand+1/2) ); 


end 


4.3.2 Exponential Law 


A random variable X is called exponential if its probability density function is: 
fx) =ae" 150), a > 0. (4.21) 


Its cumulative distribution function is: 


Fy(x)= (1 — ae~**) lix>0}- (4.22) 
Let 
a 1 
l-ae™ =u => x = —-— log(1 — u), (4.23) 
a 
hence 
1 
X= log — U). (4.24) 


a 
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Since U is uniformly distributed on [0, 1], then 1 — U is also uniformly distributed on [0, 1]. 
It follows that X can be generated by: 


1 
X =—-—log(V), (4.25) 
a 
where U is a uniform random variable on [0, 1]. In MATLAB, this procedure gives: 


function X=Exp (alpha) 
sFunction that generates the Exponential variable X 


X = -log(rand) /alpha; 


end 


4.3.3 Rayleigh Random Variable 


X is a Rayleigh random variable if its density function is equal to 


x2 
fx(x) = xe F lix>0}- (4.26) 


Its cumulative distribution function is then: 


2 


Foy = f jee aa Se = u => x = /—2log(l — u). (4.27) 
X can therefore be generated from a uniform random variable U(0, 1) by the relation: 
X = /—2log(l — U). (4.28) 
Since 1 — U also follows a uniform law on [0, 1], we can directly generate X by: 
X = y —2log(U). (4.29) 
The MATLAB program is 


function X=Rayleigh 
%Function that generates the Rayleigh variable X 


X = sqrt (-2*log(rand) ) ; 


end 


4.3.4 Gaussian Distribution 


Consider a standard Gaussian random variable X with zero mean and unit variance, denoted 
by N(O, 1). X can be generated following three different approaches. 


Method 1 


Let 12 random variables {U;};-1,....12 uniformly distributed on [0, 1], we study their sum: 


PEETA 


X =U +U2:+...+Un-—6. (4.30) 
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Since 


1 1 
E [Ui] = 5 and = Var[U;] = Dp (4.31) 


it follows: 
E[X]=0 and = Var[X]=1. (4.32) 


X is thus of zero mean and unit variance. Since X is the sum of 12 independent random variables 
having the same probability law, by virtue of the central limit theorem, we can consider X as 
a Gaussian variable. This way of generating a Gaussian random variable is interesting but not 
very precise since it is not possible to obtain rare events at the tail of the distribution. In fact, 
with this simulation Prob(X > 6) = 0 but for a standard Gaussian random variable Prob(X > 
6) > 107. 

The program in MATLAB is: 


function X=Gaussianl 
sGeneration of a Gaussian variable with method 1 


X = sum(rand(1,12))-6; 
end 


Method 2 


We saw in Chapter 2 that two Gaussian random variables X and Y with mean of 0 and variance 
of 1 can be generated from a Rayleigh variable with density 


2 
fo) = pe” 7 lip>0} (4.33) 


and a uniform random variable on [0, 277] of density 


1 
fa) = —, 0<0<2x. (4.34) 
27 
For the simulation, we set 
X = pcos(0) (4.35) 
and 
Y = psin(6). (4.36) 


We saw that p and 0 can be generated from two uniformly distributed random variables on 
[0, 1]: 


p = y —2log(U1) (4.37) 


and 


0 = 2m U3. (4.38) 
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Then we can generate two Gaussian random variables from two uniform random variables on 
[0, 1] with the following formulas known as the Box-Muller method: 


X = /—2log(U;) cos(27 U2) (4.39) 


and 


Y = /—2log(U;) sin(27 U2). (4.40) 


The program in MATLAB language is: 


function [X,Y] =Gaussian2 
SFunction that generates two Gaussian random variables with method 2 


Ul=rand; 
U2=rand; 
X=sqrt (-2*log(U1) ) *cos(2*pi*U2) ; 
Y=sqrt (-2*log(U1))*sin(2*pi*U2) ; 


end 


Method 3 


In MATLAB, the command randn directly generates a Gaussian random variable with mean 
of 0 and variance of 1. If we want to generate a matrix n x m with elements being Gaussian 
with zero mean and unit variance, the command is randn(n, m). 

Thus, in MATLAB, in order to generate a Gaussian random variable X with mean u and 
variance o°, the command is: 


function X=Gaussian3 (mu, sigma) 
$Function that generates a Gaussian variable X 
Swith mean mu and standard deviation sigma with method 3 


X = sigma * randn + mu; 


end 


We now compare all three methods of generating Gaussian random variables. We present a 
small program generating N Gaussian random variables following each of the three methods. 
We also compute their means and standard deviations. 


function Gaussian (N) 

S$Function that compares the different methods of generating 
a Gaussian random variable 

N must be an even number 


o 
ki 
o 

5 


%Method 1 
A = sum(rand(N,12),2) - 6; 
MeanA = mean (A, 1); 
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Table 4.1 Mean and standard deviation using the three methods for generating a 
Gaussian random variable 


Method 1 Method 2 Method 3 
N Mean Std. Mean Std. Mean Std. 
10 0.1066 1.4399 0.0007 1.0938 0.3265 0.1817 
10° —0.1719 0.9127 —0.1286 0.9488 0.0188 1.0673 
103 0.0517 0.9755 —0.0296 0.9886 0.0253 0.9677 
10* 0.0044 1.0155 —0.0023 1.0194 —0.0035 0.9967 
10° —0.0062 1.0009 —0.0048 1.0024 0.0015 1.0044 


10° 0.0019 0.9991 —0.0015 1.0005 0.0002 1.0009 


StdA = std(A); 
sprintf(’Method 1 \n Mean: %g, Standard deviation: %g’, MeanA, StdA) 


SMethod 2 

U1 = rand(1,N/2); 
U2 = rand(1,N/2); 
B1 = sqrt (-2*log(U 
B2 = sqrt (-2*log(U 
B = [B1,B2]’; 
MeanB = mean(B); 
StdB = std(B); 
sprintf ('Method 2 \n Mean: %g, Standard deviation: %g’, MeanB, StdB) 


)) .*cos (2*pitU2) ; 


1 
1)) .*sin(2*pi*U2) ; 


SMethod 3 

C = randn(N,1); 

MeanC = mean (C); 

Stdc = std(C); 

sprintf(’Method 3 \n Mean: %g, Standard deviation: %g’, Meanc, StdC) 
end 


We call the program with different values for N. Table 4.1 simply shows the simulated mean 
and standard deviation of a standardized normal random variable (mean zero and standard 
deviation one) using each of the three methods. 


4.4 SIMULATION OF RANDOM VECTORS 


In this section, we would like to generate a Gaussian random vector X having a known 
variance-covariance matrix Ay = cov(X, X T), Let Z= (Lye cc% Zn)' be an independent 
Gaussian random vector of zero mean and unit variance: Z ~ N (0, J), that is 


E[Z]=0 and Var(Z)=/, (4.41) 


where / is the identity matrix with 1 on the diagonal and 0 elsewhere. We want to use such a 
vector to create the random vector X. We include a review on matrix algebra in Appendix A. 
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4.4.1 Case of a Two-Dimensional Random Vector 
We want to generate the vector X with the following properties: 
2 
on a ~Nfo( kee ih ys (4.42) 
X2 7 (00102 o5 
Then 
E[X;]=0, E[X?7]=07, E[X1X2] = poyon. (4.43) 
Let the Gaussian random vector 
Z\ 1 0 
a) n (o (0 ')) (4.44) 
Then setting 
Xı = oi Zı 
; 4.45 
X2 = aZy + BZ> ( ) 
we obtain 
E[X1X2] = o, E[Z?] = g0. (4.46) 
However, we want to have E[X |X2] = po 102, hence w = po2. Moreover 
25 9) Dip) ae 2 
Var(X =a°+ BY =o, => B=+,/o;(1— 0°). (4.47) 
We choose £ = ,/o3(1 — p?). Hence 
Xı O71 0 Z| 
= : 4.48 
i) E ae. oe) 
Thus, we can generate such variables in MATLAB using the following program: 
function rep=RandomVector(sigmal, sigma2, rho) 
$Generation of a random vector of dimension 2 and correlation rho 
MatrixL = [sigmal, 0;rho*sigma2, sigma2*sqrt(1-rho%2)]; 
rep = MatriceL*randn(2, 1); 
end 
4.4.2 Cholesky Decomposition of the Variance-Covariance Matrix 
In the more general case of any n-dimensional random vector, if we take 
X=LZ =  cow(X,X')=E[LZZ'L'], (4.49) 
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where L is a matrix with the appropriate dimensions. Since E[Z] = 0 and cov(Z, Z') = /, 
then 


E[X] = 0, (4.50) 
cov(X, X')=LE[ZZ"]L' =LIL' =LL"'. (4.51) 
But we would like to have 
cov(X, X') = Ay, (4.52) 
thus 
LL' = Ax, (4.53) 


and L can consequently be obtained with the decomposition LL' (Cholesky decomposition) 
of Ax. Thus, knowing the decomposition LL" of matrix Ay, in order to generate a random 
vector X with covariance matrix Ax, we generate a Gaussian random vector Z with zero mean 
and identity covariance matrix, N (0, Z), and compute the matrix multiplication X = LZ. 

We only need to determine the decomposition LL' of Ay. Therefore, we note that if Ax is 
positive definite, that is 


Y'AxY >0 VY 40, (4.54) 


then the matrix L exists and is unique. The function chol(Ay) of MATLAB allows us to obtain 
the matrix L. 

In the case where the matrix Ay is not positive definite, we can reduce it into a matrix 
of a smaller dimension that is positive definite and on which we can use the Cholesky 
decomposition. 


Determination of L in dimension 2 and 3 


Let’s go back to the case of a vector of 2 random variables. Its variance-covariance matrix A is 
of dimension 2, and we want to determine its Cholesky decomposition, or its decomposition 


of the form LL’: 
= Oe (120102 
(120201 ae 


=i qj, anan ) (4.55) 


2 2 
421411 4% + an 
We identify the parameters appearing in the previous expression. 


2 2 
a, =0i > 41 = 0, 


411412 = P120102 > 412 = p1202, 
2 


2 2 2 2.2, 2 _ 
ahn tan = Oy => PRI tan = 07, 


> an = 02/1 — phy, 
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thus 
01 0 


L= Ny (4.56) 
P1202 4/1 — pho 


Now we complicate the problem and see what happens in the case of a vector with 3 random 
variables or a three-dimension covariance matrix. 


2 
ai pP120102 = 130103 
2 
A= | poo 03 (230203 
2 
1390301 230302 03 
a, 0 0 a1 an a3 
= |an an 0 0 an a3 
a31 a32 433 0 0 a3 
2 
ay 411421 411431 
2 2 
= | anan a5 +45 431421 + ananz |. (4.57) 


2 2 2 
431411 431421 +4324272 45; + 452 + 453 
As in the case of 2 variables, we get: 


4il = 0], 


a21 = (1202, 


/ 2 
an = ozy l — ph, 


and for the other parameters, we have 


431411 = p130301 = 431 = P1303, 


p320203 — 431421 
431421 + 432422 = 0230203 => 432 = ——————— 


a22 
(23 — P13P12 
—= 03 ————_—_—_—_ 
Li Pin 
and 
a3; + 3) +43; = OF > 433 = y 03 — 43, — a5 
> (023 — 013012)" 
= 03 1 13 2 à 
1— ph 
Hence 
O1 0 0 
po y1- (4202 0 
L= ‘ (4.58) 


£23 — P13P12 2 (p23 = pipen? 
———_—T{!_ & (oy 


3 P13 2 
Vl- ep TR 
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Example 4.1 Consider matrix A given by the following values 


oi = 0.10, 
o = 0.20, 
o3 = 0.25, 
pı2 = 0.75, 
pı3 = —0.50, 
p23 = —0.35. 
Hence matrix A is 
of p120102 130103 
A = | (120204 o (230203 
130301 230303 oF 
0.01 0.015 —0.0125 
= 0.015 0.04 —0.0175 |. (4.59) 


—0.0125  —0.0175 0.0625 
The MATLAB command to get L is 


sCommand to get the Cholesky decomposition 


chol (A); %This gives the upper triangular matrix 
L=chol(A)'; %To get the lower triangular matrix 


The result is 


0.1000 0 0 
L= 0.1500 0.1323 0 : (4.60) 
—0.1250 0.0094 0.2163 


4.4.3 Eigenvalue Decomposition of the Variance-Covariance Matrix 


The alternative to the Cholesky decomposition is the eigenvalue decomposition, also known 
as principal components analysis. It is slightly more difficult to program than the Cholesky 
method, but it works even for matrices that are not positive definite. This means that the method 
works for variance-covariance matrices with many risky factors (even hundreds). 

The eigenvalue decomposition also has the advantage that it can give intuition on the risks’ 
random structure, which can help reduce the number of necessary simulations. 

However, the eigenvalue decomposition can be difficult to obtain if different parts of the 
matrix have been generated from data spanning different time periods, since the inconsistency 
in data can produce negative variances for some principal components. 

The eigenvalue decomposition works in the following way. Find two matrices D and P 
satisfying equation: 


Var(X) = Ay = PDP", (4.61) 
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where Ax is the variance-covariance matrix, D is a matrix such that the only non zero elements 
are the ones on the diagonal, and P is an orthogonal matrix, that is: 


I=PP'. (4.62) 


This method rests on the eigenvectors forming a basis, which is guaranteed in this case since 
the matrix Ay is symmetric. The elements on the diagonal of matrix D are the eigenvalues of 
matrix Ay. Knowing that D is a diagonal matrix with all positive elements 


d 0 0 
0 0 d 


the matrix Ay can be partitioned in two, such that 
Axy=LL', with L=PVD, (4.64) 
and 
vd 0 0 
JD=| 9 > o b (4.65) 
0 O Jd 


From this decomposition of matrix Ay, the vector X can be obtained exactly as in the case of 
the Cholesky decomposition in setting 


X=LZ, (4.66) 


with the vector Z being composed of n independent random variables with unit variance, that 
is, Var(Z) = 1. 


Example 4.2 If we take the matrix A given previously, that is 


0.01 0.015 —0.0125 
A= 0.015 0.04 —0.0175 |. (4.67) 
—0.0125 —0.0175 0.0625 


We obtain the following matrices P and D 


0.9367 0.2285 —0.2652 


P = | —0.3359 0.7998 —0.4975 (4.68) 
0.0985 0.5551 0.8259 
0.0033 0 0 
D= 0 0.0321 0 ; (4.69) 
0 0 0.0771 


These matrices can be computed in MATLAB with the command 


sCommand giving the eigenvalues and eigenvectors 
Sof matrix A 


[P,D] = eig(A); %P->eigenvectors’ matrix 
%$D->corresponding eigenvalues’ matrix 
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Matrix L is obtained with the command 
sComputation of L with matrices P and D 


L=P*D.*(0.5); 


and thus 


0.0539 0.0410 —0.0736 
L = į —0.0193 0.1434 —0.1381 (4.70) 
0.0057 0.0995 0.2293 


with the decomposition 


A=LL'. (4.71) 


4.4.4 Simulation of a Gaussian Random Vector with MATLAB 


To simulate an independent Gaussian random vector of dimension n with mean 0 and variance- 
covariance matrix J (denoted by N(0, /)) in MATLAB, we use the command randn(n, 1). To 
simulate the vector Y = h(X), we simulate X and we apply the function / to it. 

Hence, to simulate the Gaussian random vector Y with mean u and the positive definite 
matrix A as its variance-covariance matrix, we express Y as a function of an independent 
Gaussian random vector N(0, I) as follows. 

We express A = JAVA, then Y = VAX + u is the Gaussian random vector N (u, A). In 
MATLAB, we will use chol( A) to get VA. Hence in MATLAB, the command for simulating 
a Gaussian random vector N(u, A) with dimension n is 


function Y=GaussianVector (n, Lambda, mu) 

$Function that generates a Gaussian random vector 
$n: Vector’s dimension 

SLambda: Covariance matrix 

smu: Vector of means 


Y = chol (Lambda) ’*randn(n,1) + mu; 


end 


4.5 ACCEPTANCE-REJECTION METHOD 


We want to simulate a random vector X having probability density function f(x). We suppose 
that there is already a program on the computer allowing us to generate a random vector Y 
of same dimension of X, with a probability density function g(y) satisfying the following 
inequality: 7 


f(x) < kg), (4.72) 
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where k is a given positive constant. We set 


soe f(x) 


= ; 4.73 
— kg(x) R 


Generation algorithm 
1- We generate the vector Y , with probability density function g(y) and a random variable U 
independent of Y ,, uniformly distributed between 0 and 1. 7 
2- If U; < a(Y,), we choose X = Y |, otherwise we reject Y} and re-generate new Y and U 
until U» < a(¥,,), then we will choose X = Y ,,. This way, the generated random vector X 
follows the probability law f(x). 

In order to illustrate this result, we consider the event {X € E}, where E is a subset of the 
real axis. Because Y „ and U,, are all independent, we immediately note that the events 


X =Y, {X=¥)},... {X= Vy} (4.74) 
are mutually exclusive, which gives 


Prob(X € E) = Prob({X =Y,,X € E}or... 
or{X =Y,,X € E}or...) 


m?’ 


= y Prob({X = Y „, X € E}). (4.75) 


m=1 


We know that 


Prob({X = Y 


om? 


X € E}) = Prob({U; > a(Y,)} N {U2 > a(Y,)} N... 
NA Uma > ¢Ly EAU < oY) 
NY, € E}) 
= Prob({U; > a(¥))}) x 
Prob({U2 > a(Y,)}) x... 
x Prob({Um—1 > oY ,-1)}) 
x Prob({Um < aË p) Yp € E) 


= (1 — py” 'Prob¥ „ € E | Um < oŒ „)) 


x Prob(Um < a(Y,,)), (4.76) 
where p = Prob(U < a(Y)). 
Since 
Prob(Y „ € E | Um < a(Y,,,)) (4.77) 
is identical to 
Prob(Y € E |U <a(Y)), (4.78) 
we have 
Prob({X = Y „, X € E}) = p(l — p)"™'ProbY € E | U < a(¥)). (4.79) 
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Finally, 


Prob(X € E) = > p(1— p)" Probl € E | U < a(Y)) 


m=1 


= pProb(Y € E | U <a(¥)) YC - py"! 
m=1 

= Prob(Y € E | U < a(Y)) 

_ Prob € E,U <a(Y)) 

ii Prob(U < a(Y)) 


(4.80) 


To determine Prob(X € E), we must compute Prob(Y € E, U < a(Y)) and Prob(U < a(Y)). 
We have 


ProbU < a0) = f ProbU < aty)e(yedy 


= f aod 
fO) 
= | —=g0)d 
io) y 
1 
sz (4.81) 
and 
a(y) 
Prob € EU sat = f gdy | du 
E ` 0 
= f aO 
fO) 
= | —=g(y)d 
S oy y 
1 
=e f fody. (4.82) 
E 
Which gives 
Prob(X € E) = / f@)dx. (4.83) 
E 


This relation shows that the probability density function of X is effectively f(x). 


Remark 5.1 We have shown that Prob(U < a(Y)) = i. which means that the rate of rejection 
is very high when & is large. When k approaches 1, the rate of rejection decreases rapidly. 
Thus, in practice, we should choose a function g(y) with values of similar size to f(x). 
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4.6 MARKOV CHAIN MONTE CARLO METHOD (MCMC) 
4.6.1 Definition of a Markov Process 


Let a random process {X,, n = 0, 1, 2,...} be given. Refer to Chapter 7 for a more detailed 
analysis of random processes. 

We suppose that X, take their values in the states’ set Q. We say that the process {X,, n = 
0, 1, 2,...} is a Markov chain if 


Prob(X 7,41 = Xn+1 | Xn = Xn, Xn-1 =Xn-1,---,X0 = x0) (4.84) 
is equal to 
Prob(Xn41 = Xn41 | Xn = Xn). (4.85) 


It means that the probability that the process is in state x,4; at n + 1, knowing that is was 
in state x, at n, is independent of everything that happened before n. All that matters are the 
successive moments n andn + 1. 

Moreover, the Markov chain will be called homogeneous if 


Probl Xari = y | Xn =X) = Pry. (4.86) 


That is, the probability of being in state y knowing that the process was in state x previously, 
Px, 18 totally independent of n. All that matters is the evolution between two successive dates. 


4.6.2 Description of the MCMC Technique 


We will show in Chapter 5 that the so-called Monte Carlo method can be used to estimate 
the mathematical expectation of a random quantity g(X), where X is a random variable with 
probability density function fx(x): 


1 N 
EUIS =D 4X), (4.87) 


n=1 


where the samples X, are independently generated by computer. 

It is not always easy to directly generate X, if fy(x) is complicated. Because the estimator 
(6.4) doesn’t require the independence hypothesis for the X,, we can use any method to 
generate them, provided that X, has f(x) as its density function. This explains the idea of 
using a Markov chain. 

Suppose that we have a homogeneous Markov chain such that the successive states {Xo, 
X,,X2,...} are characterized by the transition probability 


Prob(Xn41 | Xn). (4.88) 


The homogeneity ensures that Prob(X „+1 |X;,) doesn’t depend on n. 

How does the choice of Xo affect X,,? Expressed differently, what is the form of fx, (x|xo)? 
It is known that, under very general regularity conditions, fy, (x|xo) becomes independent of 
Xo when n is very large, that is, when the chain is in a stationary regime. 

We denote by x(x) the density function of X in the state when the chain reaches the 
permanent regime assuming that we can construct the chain such that x(x) is identical to the 
desired f(x). We let the chain evolve until it reaches the stationary regime (i.e., large M) and 
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then use the following simulated states to estimate E[q(X)]: 


N 


dan). (4.89) 


n=M-+1 


Elq(x)] © 
KONS —— 
This result of using part of the simulated states to estimate the expected value is known under 
the name “ergodic mean”. 

The most widely known method of generating a Markov chain with x(x) identical to fx(x) 


is the Metropolis-Hastings algorithm. Let g(x|y) be any conditional density function. 
Let’s define 


AOU (4.90) 


* fx(g(yix) 
Assuming that at any time n, the chain is in state X, = x,, in order to construct the desired 
Markov chain we generate two random variables. The first, Y, is characterized by the density 
function g(ylx,,) and the second one, U,,, is independent of the first and is uniformly distributed 
between 0 and 1. X„+1 is then defined by 


Y, if U <a, y); 
Xn, if U > a(x, y). 


a(x, y) = min (1 


Xu =| (4.91) 
As seen in Section 4.5, this is precisely the rejection method. 

It is important to note that since there is no predefined form for g(ylx), we will choose a 
particular formulation in order to suit the model under study. 


Notes and Complementary Readings 


For further readings, readers can eventually refer to the following publications on Monte Carlo 
simulations and numerical methods: Kloeden, Platen and Schurz (1997), Press, Teukolsky, 
Vetterling and Flannery (1992), Rubinstein (1981) and Seydel (2002). 

For a quick learning of MATLAB, the reader can refer to Etter and Kuncicky (2002) 
and Pratap (2002), and for complementary references on random variables generation with 
MATLAB, we suggest Martinez and Martinez (2002). 

Useful references on Markov chains and random processes are the books of Ross (2002 a 
and b) and Kloeden and Platen (1992). 
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____ _ Foundations of Monte Carlo Simulations — | 


In finance, asset pricing generally consists of computing the mathematical expectation of 
the payoffs. In this chapter, we describe how simulation techniques can be used to perform 
approximatively these mathematical expectations. This involves the notion of approximation 
precision. 

Several techniques exist that enhance the precision of the approximations and reduce the 
computation time. We introduce the Barraquand (1995) quadratic resampling technique used 
to improve the estimation precision for the variance-covariance matrix pervasive in financial 
modeling. We also introduce techniques that reduce the computation time and the variance 
reduction techniques such as the antithetic variables technique, the control variates technique 
and the importance sampling technique. 

We end the chapter with application examples which constitute exercises for the reader. The 
MATLAB® programs are provided as solutions for the application cases. 


5.1 BASIC IDEA 


The Monte Carlo (MC) method is a numerical calculation method used to perform numerical 
computations of functions of random variables. Its origin can be traced back to when Laplace 
and Bouffon computed the numerical value of 2 using a random experiment. Later, during 
the building of the atomic bomb in Los Alamos, Newman and Ulam developed this technique 
extensively to calculate complex integrals. 

The approach consists of performing a sequence of experiments and taking their average 
value. To clarify the concept, let us examine the following example: 


Example 1.1 Consider the following integral: 


1 
r= | cos(27x)dx. (5.1) 
0 


In general, we can calculate the expectation using the Monte Carlo method. To do that, we 
define Z as the mathematical expectation E[ . ] of a random variable as follows: 


T = E [cos(27X)], (5.2) 


where the random variable X follows a uniform distribution U (0, 1). 

We generate N independent values of a uniform random variable over the interval [0, 1]. 
The strong law of large numbers allows us to estimate Z by taking the arithmetic average 
of the realized results: 


N 
a 1 
In = N 2 cos(27 xi), (5.3) 
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where x; are the realizations of uniform random variables. A computer program to perform 
this task can be the following: 


function I=IntegralMC (N) 
Function to calculate the integral I by the Monte Carlo method 


SN: Number of simulated points to approximate the integral 


X = rand(N, 1); 
I = mean(cos(2«*pixX)); 


end 


When N goes to infinity, the estimator T, n converges to the exact value of Z. Table 5.1 
gives an illustration of that convergence to the theoretical value of zero. 


Table 5.1 Estimation of an integral using Monte Carlo 


simulations 
Number of experiments: N ty 
10 —0.1005 
100 0.0289 
1000 0.0317 
10 000 0.0110 
100 000 —3.5618 x 10-8 
1 000 000 —7.5336 x 1078 


In the above example, we can clearly see that in the computation process, we made two 
fundamentally important assumptions: 


(i) We succeed in generating by computer a uniform random variable U(0, 1), and 
(ii) The realizations x1, x2, . . . , Xy are obtained from random variables X1, X2,..., Xx which 
are statistically independent and identically distributed. 


To check whether these random variables X1, X2, . . . , Xy are uniformly distributed, we only 
need to examine their histogram. It consists of dividing the interval formed by the simulated 
values into sub-intervals, called bins. The histogram is the graph of the step function with 
values equal to the number of values in each bin. 

Depending on the length of the bins, we obtain different histograms. If the histogram remains 
roughly constant, we can conclude experimentally that the random variables are uniform over 
the interval [0, 1]. 

To verify the statistical independence of the generated samples, we plot the points (Un, 
Un+1). If the points appear to be widely spread in the unit square, we can assume the samples 
generated to be statistically independent. 

If the data are not appropriately distributed over the square, we can use correction techniques 
such as the shuffling technique (see for example Knuth (1981) and Rubinstein (1981)). How- 
ever, since the MATLAB generator rand is sufficiently satisfactory for most of our financial 
applications, we do not resort to these techniques. 
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5.2 INTRODUCTION TO THE CONCEPT OF PRECISION 


Let X be a random variable with probability density function fy(x). We would like to calculate 
E[g(X)] where g(.) is a given function. The numerical value of E[g(X)] is often difficult to 
obtain. One then needs to estimate this value using the Monte Carlo simulation technique, 
which consists of generating N independent samples X; of the random variable X, and use 
them to evaluate the estimator. 

The expectation = E[g(X)] is approximated by the arithmetic average 


1 
h= — Xi). 5.4 
N g(Xi) (5.4) 
From the law of large numbers, we have: 
Prob (|2 — A| < Eon) >1— A (5.5) 
= Ne? 
It means that / lies in the interval 
h — eon < h < fi + eon (5.6) 


with probability at least 1 — a We choose a confidence level œ, between 0 and 1, so that the 
probability that / is in the interval 


[h — son, h + eon] (5.7) 
is greater or equal to 1 — a, which requires a minimum number of simulations 
1 
Ne (5.8) 
ae 


In statistical terminology, it means the hypothesis that h is in the interval for N > b is 


acceptable with a confidence level œ and the interval 
[h — son, h + eon] (5.9) 


is called confidence interval at 100(1 — a)%. 
Note that o, is not known. We can, however, estimate its value the same way as we did for 
the mean. In other words, in practice, one has to replace o} by its estimator: 


1 N 
& = — Y (a) - h). (5.10) 


Example 2.1 Let X be a log-normal random variable with probability density function: 


OE Sgal for x > 0, (5.11) 


e 
XN 200 


with o? = 0.1 and m = 5. We would like to calculate the following quantity: 
c = E [max(0, X — 110)]. (5.12) 


This example is an illustration of a European call option value calculation, which we discuss 
in more detail later when we introduce the applications in finance. 
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To generate X, we first generate a Gaussian random variable Z ~ N(O, 1) and next 
Y = øZ + m, where Y is a Gaussian random variable with mean m and variance o”. The 
random variable X = e” is exactly the variable with the log-normal distribution. 

In MATLAB, we have: 


sGeneration of the random variable X 


X = exp(sqrt(0.1) » randn + 5); 


We generate N values of this type and then calculate the call price defined by (5.12). 
Here is the MATLAB program for that exercise: 


function Call=CalculateCall (NbTraj) 
$Function to price a call option 
$NbTraj: Number of generated points to compute the expectation 


oe — an Oe 

for i=1:NbTraj 
x = exp(sqrt(0.1) » randn + 5); 
c = c + max(0,x - 110); 

end 

Call = c/NbTraj; 


end 


This subroutine is not efficient because of the “do-loop”; we can write it in a vector 
format as follows: 


function Call=CalculateCall2 (NbTraj) 
$Function to price a call option 


$NbTraj: Number of generated points to compute the expectation 


x = exp(sqrt(0.1) * randn(1,NbTraj) + 5); 
Call = mean( max(x - 110,0)); 


end 


The number of simulations required under the strong law of large numbers can be improved 
by using the central limit theorem presented in Chapter 3. Let 


1 N 
Z= Xi)— hA), l 
z MEAG )—h) (5.13) 


with h = E[g(X)]. Z is a random variable with zero mean and unit variance. Since 


VN 1 č 
Z=- g2 (e(Ki)—#) 


h ii 


= iah (5.14) 
h 


oy 


i=1 
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it implies that 


Prob (|h — h| < Eon) = Prob (Z ia h| < Do) 


Oh Oh 
= Prob (iz < eV/N) (5.15) 


When N is relatively large, Z can be approximated by a Gaussian random variable with zero 
mean and unit variance. Hence 


rT ee ae 
Prob (1z zE N) -re e Tdx 
J 21 —eJ/N 


2 i ie 
= — e u 
VT Jo 


= ert (J5 5.16 
=ef |z] (5.16) 


We can obtain the number of simulations from the confidence interval as follows 
Prob (ızı < eV/N) > 1—a =erf(8), (5.17) 
where 


b = af! (1 — a). (5.18) 


N 
erf (5) > erf(£). (5.19) 


Since erf is an increasing function, it yields: 


Te 5.20 
E 5 28. (5. ) 


2: 
N>2 (£) (5.21) 


Hence, if « = 0.05, we have 8 = erf—!(0.95) = 1.96. For e = 0.01, we have 


It implies 


thus 


2(1.96)? 
N= (1.96) 
10-4 


This is an improvement compared to 200 0000 obtained using equation (5.8). 


= 76,832. (5.22) 


5.3 QUALITY OF MONTE CARLO SIMULATIONS RESULTS 


In the previous section, we have shown how Monte Carlo simulations can be used to estimate 
the mathematical expectation of functions of random variables by computing the average of 
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large samples obtained from the simulations. Therefore, if Y = g(X), the following expectation 


h=E[Y)=El[g(X)] (5.23) 
can be approximated by 
1 
yX). (5.24) 
i=l 
Hence, when the number of simulations is large, it is possible to replace the precision results, 
P 1 
Prob (|h — h io 3.25 
rob (| | < eon) > We (5.25) 


by the relation 


Prob (|h — h f ye 5.26 
rob (|h — h| < eon) > er eva (5.26) 


However, if of is generally not known, we then use its empirical value: 


N 
ç? = > (g(X;) —f)’. (5.27) 
i=l 
In general, the precision on GP, is relatively complex; however, if g(X;) is Gaussian, G; tends 
toward of for very large N. In practice, the Gaussian assumption on g(X;) is hardly acceptable, 

and we therefore need to find a way to approximate this Gaussian case. 
This consists of performing M calculations, each corresponding to N Monte Carlo simula- 
tions. Hence, we obtain 


A Sai 
h= 2, 8X4) (5.28) 


where Xz; is the i th simulation result of the k™ calculation. This means that (ni, Sayer hu) 
are estimators of h. From the central limit theorem, when N is relatively large, he can be 
considered as Gaussian. 

We can apply the precision concept discussed above to each hy. Exact o,, being unknown, 
we use instead Gh. 

With M realized experiments, all the results hy can be considered as independent Gaussian 
random variables with the same statistics. The estimator / can be estimated as: 


1 
h= H X k (5.29) 
k=1 
and the variance of / is: 
AQ 1 = a 2 
=r Yo (ie - hy. (5.30) 
k=l 
Hence, when M is fairly large, to obtain the Gaussian characteristic of hr, the variable 
h—h 
Ty = (5.31) 


a2 


M 
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follows a Student’s t-distribution with M — 1 degrees of freedom. The Student’s t-distribution 
is used to cope with uncertainty resulting from estimating the standard deviation from a sample. 
This distribution is often used for confidence intervals and hypothesis tests. 
The precision on h is now defined by: 
Prob (|h — h| < a) = Prob (ITu] < t), (5.32) 


where 


pa, (5.33) 


T? = , (5.34) 


with the mean họ satisfies the inequality: 

ITE] < Íi—-a,M-1> (5.35) 
where the values of the parameter t are presented in statistical tables. If the inequality is 
violated, Ao has to be rejected, unless we accept it on the base of the test. Thus, the confidence 
interval at the level œ is defined by: (h —a, h + a), where 


a = ti—a,M-14| —. (5.36) 


All the values họ in this interval can be considered as the estimator. 


Example 3.1 We would like to estimate 
E [max(0, X — K)]. (5.37) 


To achieve that, we run 30 times 100 variables (NbTraj = 100, NbSeries = 30). We then 
calculate 
100 


T 0p s) (5.38) 
with 
2(Xk i) = max (0, Xki— K) . (5.39) 
Hence 
a ieee 
h = — hg. (5.40) 


74 


Stochastic Simulation and Applications in Finance 


o 


We then proceed with the Student test. Let’s start by generating the variables Xx with 


2 = 0.1 and m = 5. The MATLAB program is the following: 


function X=GenerateVariables (NbSeries,NbTraj) 
$Function generating log-normal variables Xk,i 
sNbSeries: Number of batches of variables 
$NbTraj: Number of variables per batch 


X = exp(sqrt(0.1) » randn(NbSeries,NbTraj) + 5); 


end 


Next, after calculating 9(X;,) = max(0, Xi, — 150) and summing, we obtain Ag and h. 


We then calculate Ge. The complete MATLAB program is 


function [hHat,sigmaHat] =CalculateParameters (NbSeries,NbTraj,alpha) 
S$Function calculating the values of hHat and sigmaHat 

sNbSeries: Number of batches of variables 

SNbTraj: Number of variables per batch 

Salpha: The confidence level of the confidence interval 


X = exp(sqrt(0.1) * randn(NbSeries,NbTraj) + 5); 


%šThe mean of hk is taken with respect to the 2nd dimension 
(columns) 
hk = mean(max(0,X - 150),2); 
hHat = mean(hk,1); 

sigmaHat = std(hk) ; 


Interval=[hHat - tinv(l-alpha,NbTraj-1)*sigmaHat/sqrt (NbTraj),... 
hHat + tinv(1l-alpha,NbTraj-1)*sigmaHat/sqrt (NbTraj)]; 


end 


As we can see, to find ti-a w—1, MATLAB provides the following function 


SMATLAB function for the inverse function of the Student 
distribution 


t = tinv(l-alpha,M-1); 


The results obtained with different values of œ are presented in Table 5.2. 


Table 5.2 Confidence intervals of the Monte Carlo simulations 


h Ge a ti-a M-1 Interval 


21.7806 12.9871 0.1000 1.3114 [20.9177, 22.6434] 
21.7806 12.9871 0.0500 1.6991 [20.6626, 22.8985] 
21.7806 12.9871 0.0100 2.4620 [20.1607, 23.4005] 
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5.4 IMPROVEMENT OF THE QUALITY OF MONTE CARLO 
SIMULATIONS OR VARIANCE REDUCTION TECHNIQUES 


As we have discussed above, to obtain adequate precision we need to significantly increase the 
number of Monte Carlo simulations. With the modern computing power available nowadays, 
the number of simulations is no longer a major concern. 

However, in some complex situations, where the whole process (instead of a single variable) 
needs to be simulated dynamically, the computation time can become an important issue. Then 
ways must be found to reduce the required number of simulations. 

We examine four commonly used techniques: the first is used to improve precision and the 
remaining three to reduce the computation time. 


5.4.1 Quadratic Resampling 


When we generate random variables, we obtain sample statistics which do not in general coin- 
cide with the model statistics. To exploit the properties of the theoretical formulas containing 
these statistics, we need to transform the data in order to establish the equality between the 
sample parameters and the theoretical parameters. The parameter invariably encountered in 
the calculations is the covariance matrix of a given random vector. 

The quadratic resampling technique introduced by Barraquand in 1995 is very useful for 
probabilistic multidimensional integration. The technique is built as follows. 

Let X = (X1,...,X A be a n-dimensional random vector with mean 


my = E [X] = (Œ [X1], ..-, E Xa)! (5.41) 


and covariance matrix 


= E [XX'] -mymy. (5.42) 


1 M 
fiy = 37 DX (5.43) 
k=1 
and 
x Lis a) (a) (5.44) 
z M k=1 i n l 


where X* is the vector obtained from the k™ simulation. Developing the expression of the 
sample variance, we obtain: 


M 
> x" (X5) — my. (5.45) 


From the law of large numbers when M is very large, My and A x become close to my and 
Ax with a greater precision. 
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When M is small, this precision can be very poor. However, we can modify the data X such 
that the sample mean and covariance matrix coincide with the statistical mean and covariance 
matrix. In practice, A x is a symmetric and positive definite square matrix. The square root of 
the matrix A x exists and is regular. 


Let’s define the matrix 
=i 
H = Ax (Vax) (5.46) 


and the following vector 


Y =H (X ~fiy) + my. (5.47) 


Yt = H (X*— fy) + my (5.48) 


1 M 
~ k 
my = — J YF. (5.49) 


Developing this sum, we obtain the equality between the sample mean of Y and the statistical 
mean of X: 


M 
Xo rt = my. (5.50) 


Similarly, we can calculate the sample covariance matrix of Y as follows: 


it T 
~ k a k me 
Ry = g (vt ~ fy) (2 - fy) 


> 
ll 
an 


=H (xt amy) (xt) 
M T a E 
= HAyH" 
(ES (Ja) (V5) 
= Ax. (5.51) 


This transformation implies that the sample mean of Y* is identical to the statistical mean 
of X and the sample covariance matrix of Y` K is identical to the statistical covariance matrix 
of X. 
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To perform this transformation, we need to know the statistical covariance matrix of X. 
Under such situations, this transformation improves the precision of the calculations obtained 
by Monte Carlo simulations. 


5.4.2 Reduction of the Number of Simulations Using Antithetic Variables 


To obtain the mean of a random variable with larger variance, one needs to run a very large 
number of simulations to attain the desired precision. In contrast, when the variance is small, 
the number of simulations needed is low. Therefore, it is imperative to find transformations in 
order to reduce the variance of the obtained variable. 

The simplest variance reduction technique is the use of antithetic variables. This consists 
of generating N random variables X; and building N other variables from these with the same 
distribution but negatively correlated with the generated variables. 

It is easy to show that the variance of the sample mean obtained from the 2N variables is 
lower than the variance obtained from 2N independent variables. Indeed, consider 


N 
Ma = z Y (Xr+ Xf). (5.52) 


where 


= 2 =] 
= | oes (5.53) 
The variance of the sample mean Ma is equal to: 


Var(Ma) = E [Ma — mx) | 


ine i< 
= (KE -mo gy tm] $ (5.54) 


Since X; are independent as well as the X7@, all the 2N variables X; and X ‘ are independent 
except for the pairs X; and X7. We can then develop the expression for the variance to obtain: 


N 
Var(Ma) = eer} 
mh Iv? 
N N 
poy 2 [(X_ — mx) (X4 —mx)]. (5.55) 
k=1 j=l 
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The expectation of the last term in brackets is zero except for k = j where 


E[(X; — mx) (Xf —my)] = —ox. (5.56) 
Hence, 
Var (Ma) a E eee (5.57) 
ar(Ma) = o Oy = Oy. $ 
ONS CAN: SF 4N* 


Meanwhile, the sample variance of 2N independent random variables is equal to 


1 2 


We can therefore note that we have improved twofold the variance of the sample mean by only 
generating half random variables. 


Remark 4.1 In some cases, we cannot obtain a perfect antithetic variable, i.e. 100% negatively 
correlated with the generated variables. In this kind of situation, we will choose variables with 
same distribution and partially negatively correlated with the generated variables. 


Example 4.2 Let’s consider a random variable X uniformly distributed over the interval 
[a, b], i.e. U (a, b). The variable X“ = b + a — X is a perfect antithetic variable of X. 
Indeed, we can see that X° is uniformly distributed over the interval [a, b] and is perfectly 
negatively correlated with X. 


Example 4.3 Let’s consider a Gaussian random variable X with mean m and variance 
o, i.e. X = N(m, o°). The variable X? = 2m — X is an antithetic variable with the same 
distribution and perfectly negatively correlated with X. 


Example 4.4 Let’s consider an exponential random variable X with probability density 
function: 


fx Œœ) = e™ lizo} (5.59) 


We observe that, for this distribution, we cannot obtain a perfect antithetic variable from 
X. However, we can generate X from a uniform random variable U (0, 1) with the transfor- 
mation X = —log(1 — U). 

We then use the transformed variable X = — log(U ). This transformation gives a variable 
X with the same statistics as X but its correlation with X is not —1. Indeed, my = 1 and 


E [X —my)(X — mx)| =E [dog (U) + 1) dog (1 — U) + 1)] 
= 2E [log (U)] + E[log(U) log (1 — U)] + 1. 
(5.60) 
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5.4.3 Reduction of the Number of Simulations Using Control Variates 


The objective of simulations is to reduce the variance of the estimator used. We have just 
shown how the use of antithetic variables can significantly reduce the variance and, in the 
best case, we have a variance reduction of 50%. The antithetic variables technique exploits 
the negative correlations properties of the variables. Hence, if one can use other variables, not 
necessarily antithetic but negatively correlated with the generated variables, it is possible to 
reduce the variance of the estimator. 

Let’s consider a random variable X and another random variable Y correlated with X. We 
call Y, control variate of X. Assuming the statistical mean of Y is known, let’s construct the 
following new variable: 


X* =X +a -E[Y)). (5.61) 


We observe that the statistical mean of X* is identical to that of X: my = my». Hence, instead 
of estimating the mean of X, it is equivalent to estimate the mean of X*. 
The objective is to choose a by reducing the following variance of X*: 


Var(X*) = E [(x* = mx-) | =E[(X +a -E[Y) — my] 
= Var(X) + a7Var(Y) + 2aCov(X, Y), (5.62) 
where 
Cov(X, Y) = E [(X —my)(Y —my)]. (5.63) 


The optimal choice œ* is such that: 


dVar(X*) 
ar ee 0 = 2a*Var(Y) + 2Cov(X, Y)=0. (5.64) 
a 
It implies 
Cov(X, Y 
p= _ Cov(x, Y) (5.65) 
Var(Y ) 


With this choice, the variance of X* becomes: 


se: [Cow(X, Y)? 


This shows that Var(X*) is lower than Var(X) which is our objective. 
This analysis can also be applied with multiple control variates Y,, Y2,..., Yp of X. The 
new transformed variable is: 


X* =X +0 Yi — E Y1) +: + 0% Ve — E Y). (5.67) 
Using vectorial form, we can rewrite the equation as follows: 
X* = X +a' (Y - my), (5.68) 


where g = (@1,..., 4)", Y = (Y1, ..., Y)! and my = (E [Y1], ..., E YRD". 
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The variance of X* becomes: 
Var(X*) = E [(x* = mx) | 


= E| (x -mx +a" Œ — my) 


x (X -mx + Œ - my)"a) |, (5.69) 
Developing the right-hand side product yields: 
Var(X*) = Var(X) +a! Aya + 2a' P, (5.70) 
where 
P=E |v- my) (X — mx)| = (Cov(X, ¥1),..., Cov(X, ¥,))". (5.71) 


Assuming the covariance matrix of vector Y to be positive definite, the optimal choice for 
the vector œ corresponds to the solution of the equation: 


Var(X* dVar(X* Var(X*)\ | 
ava = ( as geen ’) =0. (5.72) 
da day day 


Applying the differential rule of a function with respect to a vector, it yields: 
dVar(X*) 


=2P+2Aya=0. (5.73) 
da = 


Since the matrix Ay is positive definite, it is therefore invertible and regular, and the solution 
of œ becomes: 


a* = —Ay'P. (5.74) 


With this optimum choice, the variance of X* becomes: 
Var(X*) = Var(X) — P'A;'P. (5.75) 


Since the matrix Ay is positive definite, its inverse is also positive definite. Thus, under 
the quadratic form, P TAF" P is a positive quantity, which means that Var(X*) is lower than 
Var(X). In the next section, we provide an illustrative example and the MATLAB program for 
the implementation of this technique. 


5.4.4 Importance Sampling 


The sample mean is the estimator commonly used in the Monte Carlo method. We have also 
seen that the statistical precision (measured by the variance of the estimator) is inversely 
proportional to the variance of the random variable where we wish to calculate the mean. 
For rare events, this variance is too low, which means that we will need a very large number 
of simulations to reach the desired precision. This number of simulations required for pre- 
cision purposes becomes prohibitive. In these situations, the classical Monte Carlo method is 
impractical. 

The importance sampling technique is a method used to overcome this problem. It simply 
consists of transforming the variables in order to reduce the variance of the simulated variables. 
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The Approach 


The principle of the method is explained as follows. Let’s consider a random variable X with 
probability density function fy(x) and Y a transformation of X: 


Y = h(X). (5.76) 
The mathematical expectation of Y is given by 
E[Y] = my = fhoras, (5.77) 
and its sample mean obtained with N samples X; is: 
we 
EY] = ñy = 5) AXo, (5.78) 
k=1 


where the sample variables X; are independent and have the same probability density function 


fx). 
Let gy(y) be a given probability density function. Analytically, equation (5.77) can be 
rewritten as follows 


fx) 
gy(y) 


The integral (5.79) can be interpreted as the mean value of the random variable Z obtained 
from the transformation of the random variable Y with the probability density function gy(y), 
where 


El] = / no pray. (5.79) 


x(Y) 
Z(Y) = mane (5.80) 
Hence 
E[Y] = E[Z]. (5.81) 
The sample mean of Z is therefore 
A wa E 
E[Z] = ñz = =, 2 ZY;), (5.82) 


where the N* random variables Y, are independent and have the same probability density 
function gy(y). 
It is interesting to note that 


EĻĒLY]] = E[EIZII. (5.83) 


This means that one can use E [Z] to estimate E[Y]. In the particular case where 


Y = X = h(X), (5.84) 
equation (5.82) becomes 
x 1 Š nrf 
E[Z] = — ———, 5.85 
= N* D gy (Vx) oem 
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and the relative error of the estimation is 
EIZ] — E[Z] 
E[Z] 


where ņ is the degree of certainty (i.e. 1 — 7 is the confidence level of the estimator). 
When N and N* are large, the precision of the estimator E[Z] (or E[Y ]) is given by 


ve[o@T eee. sm 


Prob(| 3 e) zi, (5.86) 


with 


-2 1 x 
O(x) = 7Tdu = zafe (=) ; (5.88) 


1 00 

zal f V2 
Thus, N* depends on Var(Z). To reduce N*, one needs to reduce Var(Z). The reduction ratio 

for this kind of precision is by definition 

Var(Z) 

Var(X) 


Solving the problem consists of determining gy(y) to reduce Var(Z). 


(5.89) 


Illustration 


To illustrate the procedure, let’s consider the following example. Consider V a standard 
Gaussian random variable. We would like to compute the following quantity using the Monte 
Carlo method: 


Prob(V > 6) = -*dv = Q(6). (5.90) 


1 [00] 
—— e 
~v 27 | 
This quantity is relatively small, i.e., approximately 107°. The classic Monte Carlo technique 
consists of generating N independent samples V; and to estimate Prob(V > 6): 


ProbV > 6) =~ ay (Ve) (5.91) 
TO > 6)= — e), ; 
N A 6\Yk 
where 
l, x>6; 
I6(x) = k ce. (5.92) 


2 
To obtain ņ = 0.05 and £ = 0.1, we have i = 20 and [o~ (298) ] = 3.84, hence 


2 


NSS (5.93) 
~~" 0.01 E[Z]2’ i 
0, x <6; 
X = I(x) = | ee (5.94) 
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It yields 
E[X]= Prob(V > 6) 
and 
Var(X) = E[X*] — E[X]? = Prob(V > 6) — E[X[’. 
Since E[X] = Prob(V > 6) ~ 107°, E[XF is negligible and (5.93) becomes 


y > 3:84 EIX] _ 384 


= = 3.84 x 10!!, 
= OO1EIXE E[X] a 


(5.95) 


(5.96) 


(5.97) 


We then observe that the number of simulations required to reach the desired quality 
is excessively prohibitive. To significantly reduce the number of simulations, we need to 
determine an appropriate probability density function gy(y), i.e., with gy(y), Y occurs relatively 
frequently over the zone where we need to have X > 6. One immediate possible choice 


(illustrated by Figure 5.1) is 


6) = ae 
= —e 2 
&y\y Jin 
Hence, using equation (5.80) we obtain: 
Z= xl, 
g(v) 
and since 
fœ) v2, w6)? 36 _6y 
-e 7" = 2e ; 
gv) 
it implies 


n(x) 


0 : g 
-5 0 5 10 


Figure 5.1 Probability density functions of the Normal distribution N(0, 1) and N(6, 


(5.98) 


(5.99) 


(5.100) 


(5.101) 


1) 
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where 


0, V <6; 


XV) = i oe. (5.102) 


We have already shown that E[Z] = E[X]. We need to compute the variance of Z, 
Var(Z) = E[Z*] — E[Z/’, (5.103) 


where 


E[Z?] = f 2(v) fy(v)dv 


doy 36 l 2 
= e e” —_~e Tdv 
6 ~ 20 


1 9: (vt12)? , 122 
___ 936 e 2 tid 


J 20 6 


1 f @+122 

36+72 — ey 

=e — e 2 dv. (5.104) 
Vax J6 


v 


Applying the following transformation of variables: v + 12 = u, we obtain 


E[Z*] = es f e" du 
~v 2m Jig 
= ¢!% Q(18). (5.105) 


When a is large, we can approximate Q (œ) by et. Finally, we have: 


1 _ 1s? 
e 2 
~ 2718 


—162+108 


E[Z7] = 108 


e 
— A2718 


Seis, (5.106) 


From the previous calculations, we obtain E [Z]?, which allows us to determine Var(Z). 
Using (5.87) (with n = 0.05 and £ = 0.1), we observe that the reduction in the number of 
simulations is impressive. The number of simulations needed is around 1000. 


5.5 APPLICATION CASES OF RANDOM 
VARIABLES SIMULATIONS 


5.5.1 Application Case: Generation of Random Variables as a Function of the 
Number of Simulations 


This application consists of writing a program to generate two Gaussian random variables 
(X1, X2) with the following moments: E[X,] = 0, E[X2] = 0, E[X7] =a’, E[X3] = b? and 
E[X| Xo] = c?. 
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Table 5.3 Impact of the number of simulations with simple simulations 


N E[X:] E[X{] E[X:] E[X3] E[X |X] p 
10 —0.089 0.087 —0.322 0.726 0.191 0.760 
100 0.054 0.176 0.042 0.671 0.284 0.826 
1000 —0.023 0.155 —0.051 0.622 0.235 0.757 
10000 0.004 0.158 0.011 0.637 0.247 0.779 
100000 0.001 0.160 0.003 0.639 0.250 0.782 


Exact values 0.000 0.160 0.000 0.640 0.250 0.781 


Simple Simulations 


Generate N = 10, 100, 1000, 10000 and 100000 pairs of variables with the following param- 
eters: a = 0.40, b = 0.80, c = 0.50. Evaluate their first and second order sample moments, 
as well as the empirical correlation between the two variables. Compare the sample statistics 
with the theoretical statistics for different values of N. What do you observe? 

The MATLAB program used to produce the simulation results given in Table 5.3 is the 
following: 


function [Moy, Cov] =NumberSim(a,b,c,NbTraj) 

$Function comparing the sample statistics as a function of 
$the number of simulations 

$a: Standard deviation of X1 

$b: Standard deviation of X2 

$c: Covariance of X1 and X2 

SNbTraj: Number of simulations 


MatrixCovariance = [a^î2,c^2;c^2,b^2]; 
MatrixL = chol (MatrixCovariance) '; 


X = MatrixLs«randn(2,NbTraj) ; 
Moy=mean (X,2); 
Cov=cov (X’); 


end 


where the MATLAB functions mean and cov give the mean vector and the covariance matrix 
of X. 

Table 5.3 shows that one needs a relatively large number of simulations to yield more 
accurate first and second order moments. 


Simulations with Quadratic Resampling Technique 


Use the quadratic resampling technique of Barraquand (1995) to generate N = 10, 100, 1000, 
10000 and 100000 pairs of variables with the following parameters: a = 0.40, b = 0.80, 
c = 0.50. Evaluate their first and second order sample moments, as well as the sample corre- 
lation between the two random variables. Compare the sample statistics with the theoretical 
statistics for different values of N. What do you observe? 
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Here is the MATLAB program for the generation of samples using the quadratic resampling 
technique. 


function Rep=ReQuadratic (X,MoyTheo, CovTheo) 

SFunction to perform the quadratic resampling 

$X: Sample matrix of the variables where the number of columns 

is equal to the number of simulations and the number of line is 
the dimension of the vector 

heo: Theoretical mean vector of the random vector 

heo: Theoretical covariance matrix of the random vector 

Rep: Sample matrix of the modified sample values 


Q 
O 
P 


dP dP W X ol? 
S 
O 
KX 


NbTraj = size(X,2); 


sDetermination of the sample parameters 
MoyEmp = mean(X,2); 

CovEmp = cov((X-repmat (MoyEmp,1,NbTraj))’) ; 
LEmp = chol(CovEmp) ’; 


LTheo = chol (CovTheo) ’; 


Rep = LTheoxinv (LEmp) + (X-repmat (MoyEmp,1,NbTraj) ) 
+ repmat (MoyTheo,1,NbTraj) ; 


end 


We can then use the quadratic resampling technique in the general program as follows 


function [Moy, Cov] =SimulationsWithRQ(a,b,c,NbTraj) 
$Function comparing the sample moments as a function of 
Sthe number of simulations 

Sa: Standard deviation of X1 

Sb: Standard deviation of X2 

$c: Covariance of X1 and X2 

$NbTraj: Number of simulations 


Theoretical parameters of the variables 
MoyTheo=[0;0] ; 

CovTheo= [a^î2,c^2;c^2,b^2]; 

L=chol (CovTheo) ’; 


%Simulation of the random variables 
Sample=randn (2,NbTraj) ; 
SampleSimple=repmat (MoyTheo,1,NbTraj)+L«Sample; 


sCalling the function ReQuadratic 
XRQ=ReQuadratic (SampleSimple,MoyTheo, CovTheo) ; 


Moy=mean (XRQ, 2); 
Cov=cov (XRQ’); 


end 
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Table 5.4 Impact of the number of simulations with the quadratic resampling 


N E[X:] E[X7] E[X2] E[X3] E[X |X] p 
10 0.000 0.160 0.000 0.640 0.250 0.781 
100 0.000 0.160 0.000 0.640 0.250 0.781 
1000 0.000 0.160 0.000 0.640 0.250 0.781 
10000 0.000 0.160 0.000 0.640 0.250 0.781 
100000 0.000 0.160 0.000 0.640 0.250 0.781 


Exact values 0.000 0.160 0.000 0.640 0.250 0.781 


Results are presented in Table 5.4. Contrary to the simple simulation case, the quadratic 
resampling technique gives the exact first and second order moments, i.e., the sample moments 
are equal to the theoretical moments regardless of the number of simulations. 


5.5.2 Application Case: Simulations and Improvement of the Simulations’ Quality 


Let’s consider two Gaussian random variables X; and X2 with means mı = 10 and m = 15, 
and variances o? = 0.20? and o = 0.50?, respectively. We assume the correlation between 
the two Gaussian random variables to be pọ = —0.30. 


Simulations with Antithetic Variables 


Use antithetic variables in your simulations to calculate the sample first order moments of 
the random variables Yı = e*! and Y> = e*?. Devoid of a variance reduction scheme, the 
MATLAB program used to simulate may be as follows 


function Moy=SimulationsWithoutTechnique 
$Function to calculate the mean of the variable Y1 without using 
$a simulation improving quality technique 


NbTraj=100; Number of simulations 
sigmal=0.2; Standard deviation of X1 
sigma2=0.5; sStandard deviation of X1 
rho=-0.3; %Correlation between X1 and X2 


$Theoretical mean and covariance of the variables X1 and X2. 
MoyTheo=[10;15]; 
CovTheo= [sigmal*2,rhox#sigmals«sigma2 ;rhoxsigmalssigma2,sigma2*2]; 


Sample=randn(2,NbTraj); Simulation of the Gaussian variables 
L=chol (CovTheo) ’; Cholesky factorization 


$Simulation of the variables X1 and X2 
SampleSimple=repmat (MoyTheo,1,NbTraj)+L«Sample; 


sTransformation of the variables X1 and X2 into Y1 and Y2 
Y1Simple=exp (SampleSimple(1,:))j; 
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Y2Simple=exp (SampleSimple(2,:)); 
Moy=mean (Y1Simple, 2) ; 


end 
The MATLAB simulation program with antithetic variables may be as follows: 


function Rep=SimulationsAnti 

$Function to calculate the mean of the variable Y1 using the 
Santithetic variables technique. 

SRep: Mean of the variable Y1. 


NbTraj=100; 
sigmal=0.2; 
sigma2=0.5; 
rho=-0.3; 


sTheoretical parameters 

MoyTheo=[10;15] ; 

CovTheo= [sigmal*2,rho«#sigmals«sigma2 ; rhoxsigmalsxsigma2,sigma2~2]; 
L=chol (CovTheo) ’; 


SSimulation of the random variables. 
Sample=randn (2,NbTraj) ; 
SampleSimple=repmat (MoyTheo,1,NbTraj)+L«Sample; 


SIntroduction of the antithetic variables 

XAnti=cat (2,SampleSimple,2x*repmat (MoyTheo,1,NbTraj)-... 
SampleSimple) ; 

sTransformation of Xi into Yi 

Y1Anti=exp (XAnti(1,:)); 

Y2Anti=exp (XAnti(2,:)); 


Rep=mean (Y1Anti,2) ; 


end 


Simulation results are presented in Table 5.5. 


Table 5.5 Simulations with antithetic variables 


Simple Antithetic 
NbTraj Estimation Error Estimation Error 
100 2.2097 E04 418 2.2424 E04 302 
1000 2.2264 E04 141 2.2482 E04 103 
10000 2.2450 E04 46 2.2482 E04 32 
100000 2.2456 E04 14 2.2469 E04 10 


1000000 2.2478 E04 5 2.2472 E04 3 
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We note that the antithetic variables not only improve the precision, but also considerably 
reduce the variance of the simulated values. 


Simulations with Control Variates 


Use the control variates in your simulations to obtain the sample first order moments of the 
random variables Y; = e*! and Y) = e*?. The implementation of this method is as follows. 
We use X; as a control variate for Y;. 


function Rep=SimulationsCont 
$Function to calculate the mean of the variable Y1 using the 
$control variates technique 


NbTraj=100; 
sigmal=0.2; 
sigma2=0.5; 
rho=-0.3; 


% Theoretical parameters 

MoyTheo=[10;15]; 

CovTheo= [sigma1l*2,rhox#sigmals«sigma2 ;rhoxsigmalssigma2,sigma2*2]; 
L=chol (CovTheo) ’; 


Simulation of random variables 
Sample=randn(2,NbTraj) ; 
SampleSimple=repmat (MoyTheo,1,NbTraj)+LsSample; 


$Transformation of Xi into Yi using the function Control given 
below 

% for the control variates 
Y1Cont=Control (exp (SampleSimple(1,:)),... 


SampleSimple(1,:),10); 
Y2Cont=Control (exp (SampleSimple(2,:)),... 
SampleSimple(2,:),15); 


Rep=mean (Y1Cont, 2) ; 


end 


function Rep=Control (A,B,MoyB) 

$Function using the control variates technique 
Sto adjust the simulated variables. 

SA: Simulated variables 

$B: Control variables 

S$MoyB: Theoretical mean of the control variables 
SRep: Modified variables 


$Covariance between A and B 
MatrixCovEmp = cov(A,B); 
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Table 5.6 Simulations with control variates 


Simple Control 
NbTraj Estimation Error Estimation Error 
100 2.2097 E04 418 2.2414 E04 48 
1000 2.2264 E04 141 2.2471 E04 20 
10000 2.2450 E04 46 2.2482 E04 7 
100000 2.2456 E04 14 2.2469 E04 2 
1000000 2.2478 E04 5 2.2472 E04 1 


$Calculation of the parameter alpha 
alpha = - MatrixCovEmp (1,2) /MatrixCovEmp (2,2) ; 


Rep = A + alphax(B - MoyB) ; 
end 


With these control variates, we obtain more precise results as shown in Table 5.6. 
An interesting exercise for the reader would be to combine the different variance reduction 
techniques with those that improve the precision (we may call this an hybrid approach). 


Notes and Complementary Readings 


For additional references, the reader can eventually consult the following classical books on 
Monte Carlo simulations and numerical methods: Press, Teukolsky, Vetterling and Flannery 
(1992), and Rubinstein (1981). 

For complementary readings on variance reduction techniques and simulation quality im- 
provement methods, see Dupire (1998), Glasserman (2003), Jackel (2002), Kloeden, Platen 
and Schurz (1997), Lapeyre, Pardoux and Sentis (1998). 
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Fundamentals of Quasi Monte Carlo 
(QMC) Simulations 


In the previous chapter, we introduced the Monte Carlo simulation techniques and presented 
variance reduction and precision improving schemes. We also saw that the Monte Carlo 
method allows us to estimate the mathematical expectation of a random quantity by generating 
a number of statistically independent and identically distributed variables. 

Monte Carlo simulation is a reasonable choice when we can’t analytically estimate this 
mathematical expectation. We shouldn’t forget that the mathematical expectation is represented 
by a multi-dimensional integral (in the case of random vectors). Viewed in this perspective, 
the Monte Carlo method is an interesting alternative when we want to numerically evaluate 
complex integrals. 

This method has a weakness stemming from the random nature of the results. Because of this, 
not long after the introduction of the Monte Carlo method at the end of the 1940s, researchers 
were interested in the possibility of replacing the random samples in the computations with 
deterministic series. This corresponds to the fundamental idea of the Quasi Monte Carlo 
method. 

These sequences are known as low discrepancy sequences. As we saw previously, we can 
always find transformation of variables to transform the integration domain to a hypercube. 
Building a low discrepancy sequence consists of choosing the points (or sequences) inside the 
hypercube in order to assure good numerical results for the integration. 

The concept of low discrepancy corresponds to the gap between the sample originating from 
a deterministic sequence and that obtained from a uniform random variable. The sequence has 
the property that, for all integers n, the first n elements of the sequence are almost uniformly 
distributed on [0, 1]. The definition of “almost uniformly distributed” may vary depending on 
the measure used to define the difference between two sequences. 

In this chapter, we present different algorithms used to generate low discrepancy sequences 
(LDS). The Van Der Corput sequence is the simplest one and serves as a base in building 
other sequences used in finance. The best known sequences are those of Halton, Faure and 
Sobol. The building principle of these sequences consists of dividing the unit hypercube into 
smaller hypercubes whose faces are parallel to the faces of the unit hypercube. One point of 
the sequence is placed in each small hypercube. Once all the hypercubes contain one point, 
then the unit hypercube is subdivided into smaller ones as new points are distributed in the 
empty spaces. 

The dimension of a sequence d is usually defined as the number of time steps + or the 
number of underlying assets depending on the problem to be solved. The biggest challenge for 
having a good low discrepancy sequence is to avoid the clustering of points in a small region. 
This phenomenon of finding many points in a small region is often found in standard Monte 
Carlo simulations and is the result of correlations between the dimensions of a sequence, 
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that is the correlation between two instants ¢ and s or the one of two underlying asset values 
simulated. This correlation is described later in the chapter. 

At the end of the chapter, we provide a comparative overview of the different sequences. 
Another application of the Quasi Monte Carlo method is presented in Chapter 15. However, it 
is important to note that the efficiency of this method is not always guaranteed; it depends on 
the problem we are studying. 


6.1 VAN DER CORPUT SEQUENCE (BASIC SEQUENCE) 


The Van Der Corput sequence is the basic sequence of many Quasi Monte Carlo simulations 
(QMC). We now present the method to follow in order to determine the n element of the 
sequence. We begin by choosing a prime number b. To find the n number, we simply have to 
perform the following operations: 


1. Write n in base b; this allows us to find a; (n) such that: 


m 


n=} aj(n)bi, (6.1) 
j=0 


where m is the smallest integer such that a; (n) = 0 for all j > m. 
2. “Reverse” the number n to the decimal point (see example) to find the value of the n” 
element which we denote by b,: 


by = n) = a (6.2) 
j=0 


In reversing the number, we make sure that the value lies in the interval (0, 1). For example, 
if n = 1140, or equivalently 1023, then 3, = 19/27 which equals 0.2013. 


If the chosen base is 2, then we have the following sequence: 
0, 1/2, 1/4, 3/4, 1/8, 5/8, 3/8, 7/8, 1/16, 9/16, 3/16, 11/16, 5/16, 13/16,... (6.3) 
However, in the case of base 3, we have: 
0, 1/3, 2/3, 1/9, 4/9, 7/9, 2/9, 5/9, 8/9, 1/27, 10/27, 19/27, 2/27, ... (6.4) 


The further we look into the sequence, the closer to 1 the points will be (without ever getting 


exactly 1). We present next a MATLAB program that allows us to find the n element of the 
sequence in base b. 


function rep=VanDerCorput (n,b) 

SFunction that computes the elements of the Van Der Corput sequence 
$n: Beginning point to generate the sequence 

Sb: base of the sequence 

sThe n th element in base b 

bn=0; 

$j] represents the powers of b in the decomposition of n 

j=0; 
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while n~=0 
bn=bn + mod(n,b) /b* (j+1); 
n=floor(n/b) ; 
j=j+1; 

end 


rep=bn; 
end 


6.2 HALTON SEQUENCE 


The Halton sequence is a multi-dimensional extension of the Van Der Corput sequence. This 
sequence is easily derived from the Van Der Corput method. To build the Halton sequence, we 
use the points in the sequence of the Van Der Corput but change the base for each dimension. 
We use base 2 for the first dimension, base 3 for the second dimension, base 5 for the third 
dimension, etc. In fact, the x” dimension will be the Van Der Corput sequence obtained from 
the x” prime number. In Table 6.1, we show the first elements of the Halton sequence in 
dimensions 1, 2, 3 and 4. 

If we use MATLAB’s random number generator to generate 1000 couples in the unit square, 
then we obtain Figure 6.1. 

In the Halton sequence case, we get Figure 6.2 for dimensions | and 2 when we have 1000 
points. 

We can see from Figure 6.2 that the points are more evenly distributed and scattered 
in the unit square. With MATLAB’s random generator, the sequence contains many points 
condensed and clustered in the same area. The problem with the Halton sequence occurs 
with higher dimensions. Too many points are needed to cover the entire unit hypercube. For 
example, the points for the 27” and 28” dimensions are represented in Figure 6.3. 

To solve this problem, we will permute some Van Der Corput sequences in order to disperse 
our sample. 


Table 6.1 First terms of Halton sequence 


Term Dimension 1 Dimension 2 Dimension 3 Dimension 4 


base 2 base 3 base 5 base 7 
1 1/2 1/3 1/5 1/7 
2, 1/4 2/3 2/5 2/7 
3 3/4 1/9 3/5 3/7 
4 1/8 4/9 4/5 4/7 
5 5/8 7/9 1/25 5/7 
6 3/8 2/9 6/25 6/7 
7 7/8 5/9 11/25 1/49 
8 1/16 8/9 16/25 8/49 
9 3/16 1/27 21/25 15/49 
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Figure 6.1 MATLAB’s random generator 
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Figure 6.2 1000 elements of the Halton sequence in dimensions 1 and 2 
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Figure 6.3 1000 terms of the Halton sequence for dimensions 27 and 28 
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6.3 FAURE SEQUENCE 
The Faure sequence looks like the Halton one, with two major differences: 


e We choose the same base for all of our problem’s dimensions, 
e We use a permutation of the vector composed of the elements for each dimension. 


For a d-dimensional simulation, we take Van Der Corput sequence in base b, where b is the 
smallest prime number such that b > d and we permute the terms. The algorithm is given by: 
1. n= Sane! , 

2. b! = 57, 20 


i=0 pH > 
k j! k—1 
3. ai (n) = Po Wor (n) mod(b), 
k 
k m a;(n) 
4. by =a Dio ptt: 


We can further simplify the calculations by using matrix notation. The previously presented 
algorithm can be rewritten as: 


aj(n) gO @ @ @ ay (n) 

aj (n) One) are ame Ey) ay ‘(n) 

aim}—| 0 0 (3) O ay '(n) |, (6.5) 
a3(n) 0 0 0 


as '(n) 


In MATLAB this algorithm can be implemented as follows 


function Faure (nb, Dimi, b) 

%šFunction that generates Faure sequence elements 
snb: Number of elements to generate 

$Dim1l: First dimension of the random points 

sb: base 


bn=zeros (nb, 2); 


for l=1:nb 
sWe decompose 1 in base b 
a=decompose(1,b) ; 
m=size(a,1); 


sTransition matrix between dimensions 
MatTrans=zeros(m,m) ; 
for j=1:m 
for i=j:m 
MatTrans (j,i) =nchoosek (i-1,j-1); 
end 
end 


$Computation of vector a and for next iteration 
a=mod (MatTrans~* (Dim1-1)*a,b) ; 
aplusl=mod (MatTrans*a,b) ; 
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$Computations of points bn 
for f=1:m 
bn(1,1)=bn(1,1)+a(£,1)/b* (£) ; 
bn (1,2) =bn(1,2)+aplus1(f£,1)/b* (£); 
end 
end 


sWe plot the points we got 
figure 
plotbn'(: yt) , brits ,2) 4° 2") 
end 


$Function that decomposes a number in the given base 
function rep=decompose (n,b) 

sn: Number to decompose 

sb: base of the decomposition 


temp=1; 

while n~=0 
rep (temp,1)=mod(n,b) ; 
n=floor(n/b) ; 
temp=temp+1; 

end 

end 


Using this program, we obtain the random samples plotted in Figures 6.4 and 6.5. 

Similar to the Halton case, Faure sequence’s speed decreases when the dimension increases. 
However, this feature is less problematic than the Halton sequence since the prime numbers 
used are smaller. For example, when working in dimension 50 the Faure sequence uses the 
prime number 53, the smallest prime number greater than 50, while the Halton sequence uses 
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Figure 6.4 1000 terms of Faure sequence in dimensions 1 and 2 
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Dimension 29 (base 31) 
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Figure 6.5 1000 terms of Faure sequence in dimensions 28 and 29 


prime number 229 which is the 50” prime number. Faure’s approach links the theory of low 
discrepancy sequence with the combinatorial theory of vector permutations. Faure sequence’s 
speed decreases starting from the 25” dimension. Many people consider the Faure sequence to 
be significantly more time consuming than the Halton sequence, or the Sobol sequence which 
is described in the next section. 


6.4 SOBOL SEQUENCE 


The Sobol sequence is built from the Van Der Corput sequence by alternating the elements 
for each dimension. We still have the same elements but we permute the order. We present 
the algorithm and its implementation for obtaining the Sobol sequence. Suppose that we want 
to generate the n + 1” term of the sequence for dimension j denoted by /,,;. We begin the 
sequence with x, = 0.5 in base ten or equivalently 0.1 in binary. We begin by writing n in 
binary. 


N = 82 fy Gl ee hi, (6.6) 


We call k the subscript i of the first coefficient n; ~ 0 in the binary representation of n. For 
example, if nı = 1 (nı Æ 0) then k = 1, if nı = 0 and m = 1 (m Æ 0) then we take k = 2,... 
Finally, we only need to compute element x,,,; in the following way: 


Xn+1 = Xn ® Vk, (6.7) 


where vz is the k direction for dimension j (keep in mind that all the calculations are only 
for dimension j). The v, are deterministic numbers. The symbol ® represents the exclusive 
binary operator “OR”. The elements v are used to permute the Van Der Corput sequence 
terms. They are generated by recurrence: 


vi = A, Vj-1 ® A2Vj-2 ® ... D Ag—1Vi-g+1 D Vi-g ® (Vi-g/2"), (6.8) 


where this time, 7 > q and aq; are the coefficients (0 or 1) of a degree q primitive polynomial 
of the form P(x) = x4 + ax! + axx? + ... + ag-ix! + 1. We associate a primitive 
polynomial with each dimension in order to permute the sequence. Our aim with this book is 
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Table 6.2 Primitive polynomials for first dimensions 


Degree Primitive polynomials modulo 2 
1 x+1 
2 xX +x+1 
3 XL+x+ land +x +1 
4 X +x+ lanx +x +1 
5 DHELE 4+ P41Pt¢P4¢P4txe41wrt¢xetgPtut lL HaHa Hl 


and Ý +47 +EH 
6 MO4+xt HAHH Hl, E H Hl EH HaHaHa H He 42°41 
and xf + +x4+%4+1 


not to go over the theory of primitive polynomials or the algebraic theory so we are presenting 
only some of them here. We fill Table 6.2 with primitive polynomials up to degree 6. 

Very often it is more convenient to modify the v; and work with the values M;. The values 
of M; are defined by: 


Mi = 2! Vi (6.9) 
and thus satisfy the recurrence 
Mi = 2a,M;-| ($>) 2 arMi_> @...@ 29 1ag_-1Mi-g+1 ($> 21 Miq ($> Miq. (6.10) 


It follows from the properties of v; that M; is an integer odd number between 0 and 2’. In order 
to get the values of M; by recurrence, we need the starting points for each dimension. 

Table 6.3 presents some examples of the terms used for the implementation. We must remark 
that the number of initial values depends on the dimension of our problem since for higher 
dimensions M; depends on Mj_1, Mj-2, Mj-3.... 

In order to simplify the algorithm, instead of generating values of x;, we will re-scale the 
values to generate y; which will be a multiple of x;. Suppose we want to generate x,,,. Set 
My+1 to be the smallest power of 2 such that 


241 > n+l. (6.11) 
We can show that 


mnsi = [log(n + 1)/log(2)]. (6.12) 


Table 6.3 M,’s initial values 


Degree Polynomial Initial values 


BRRWWNR 
RENFRO 
Ree RRP RR 
Re RW We 
MmWwwnr 
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By setting y,4; = 2”""*'x,41, we can compute 


Yat = 2 Xn 41 
= 2™ (Xn ® Vg) 
= 2" (x, B M/Ž) 
= 2" x, 2" M; 
_ J yn ® Mnk M, if mn = Mn41 


E | 2yn © 2 -*M, ifnot oe 


Let us now present an algorithm to generate the Sobol sequence. This algorithm borrowed 


from Numerical Recipes in C by Press et al. (1992) will be programmed in MATLAB. QMC 
techniques rely heavily on number theory, which can be too technical. For practical purposes, 
we provide here only a general description of the method. 


=. 


W 


eN 


. Select a primitive polynomial for dimension j. 
. Randomly choose the starting values for the first terms M; (those we need to begin the 


recurrence). 


. Determine position k of the rightmost zero in the decomposition of n in base 2. 
. Compute a new M; by recurrence if needed with the chosen polynomial in step 1 and the 


starting values of M. 


. Compute the value of y„+ı with the formula shown above. 
. Compute x,,) from y,4) USING X41. = Yn41/ 2H. 
. Return to step 3 to generate the following elements of the sequence. 


Example 4.1 We begin with the polynomial 
P&)=x°+x+1 (6.14) 


and with initial values Mı = 1, M2 = 1, yı = 1. It follows that xı = 1/2. For the second 
term, we have k = 2 since lpasel0 = lpase2 = 1 * 2° + 0 * 2! and m = 2 since 2? > 3 and 
2! < 3. Thus 
y2 = 2y, © M2 
= 10) 6 2°1, 
= 1l 
= 310 
and x» = yo/2? = 0.75. For the third term, k = 1 since 2pase10 = Olpase2 = O * 2° + 1 *2!, 
m3 = 2 since 2? > 4 and 2! < 4, 
y3 =y B27 ' Mı 
= 11,610) 
= 0l; 
= lio 
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and x3 = y3/2? = 0.25. For the fourth term, k = 3 since 3pase10 = 110 pase2 = 1 * 2° + 1 * 
2! + 0 * 27, m4 = 3 since 2? > 8 and 2? < 8. We find M3 = 7 and 
y4 = 2y3 ® 2°M3 
= 10. 6111, 
= 101, 


= 510 


and x4 = y4/2? = 0.6250. 


Here is a MATLAB program to generate the Sobol sequence. 


function rep=SobolShuffle (Nb) 
SFunction that generates points of Sobol sequence 
SNb: Number of points to simulate 
Polynomial=1; %Choice of a polynomial 
Degree=2; Degree of chosen polynomial 
M=[1;1]; %Initial values of the M_i’s 
SFirst step 
m=1; 
y(1)=1; 
x (1) =y (1) /2°m; 
$Principal loop for elements 2 and above 
for n=2:Nb 

Position=Pos(n-1); 

sAdd an element to vector M 

if Position>size(M) 

M=AddM (M, Polynomial, Degree) ; 

end 

m=ceil (log(n+1)/log(2)); 

y (n) =bitxor (2^ (m==Position) *y(n-1),... 

2* (m-Position) *M(Position) ); 

x(n) =y(n) /2%m; 
end 
rep=x; 
end 


$Function determining position of rightmost 
$zero in the decomposition in base 2 of k 
function rep=Pos (k) 
sk: Number to decompose 
PosTemp=1; 
while bitand(k,1)==1 

PosTemp=PosTemp+1 ; 

k=bitshift (k,-1); 
end 
rep=PosTemp; 
end 
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$Function that computes a new value M 

function rep=AddM (M, Polynomial, Degree) 

$M: actual vector 

$Polynomial: Primitive polynomial 

sDegree: Degree of the chosen polynomial (from table) 

Length=size(M,1); 

Temp=bitxor (2^ (Degree) *M(Length+1-Degree),... 

M(Length+1-Degree) ) ; 

i=Degree-1; 

while Polynomial~=0 
Temp=bitxor (Temp, 2^ (i) *mod(Polynomial,2) *M(Length+1-i) ); 
Polynomial=bitshift (Polynomial, -1) ; 


i=i-1; 
end 
rep=[M;Temp] ; 
end 


This sequence has been used as a benchmark for many efficiency tests for QMC deterministic 
methods. 


6.5 LATIN HYPERCUBE SAMPLING 


This method is different from the previous ones since it is no longer deterministic but random. 
In fact, we proceed as follows: 


1. Divide each dimension of our space in N sections. 
2. Generate a uniform random variable and randomly distribute it anywhere in the first section. 
3. We repeat the previous steps to distribute the values in all the sections of our problem. 


We present here the MATLAB program that follows these steps. In this program, we set N to 
be the number of points to simulate while k is the dimension of the sample: 


function rep = LHS(N,k) 

$Function using the latin hypercube method 
SN: Number of points to generate 

$k: Dimension of the sample 


$Generation of the random numbers matrix 
Matrix = rand(N,k); 
$Loop to generate the permutations and mixing the terms 


for j=1:k 

per = randperm(N) ; 

Matrix(:,j) = (per’-1+Matrix(:,j))/N; 
end 


rep = Matrix; 


end 
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Dimension 2 


0 0.2 0.4 0.6 0.8 1 
Dimension 1 


Figure 6.6 1000 points of the Latin Hypercube in dimensions 1 and 2 


Dimension 30 


0 0.2 0.4 0.6 0.8 1 
Dimension 29 


Figure 6.7 1000 points of the Latin Hypercube in dimensions 29 and 30 


We can see from Figures 6.6 and 6.7 that the points are as well distributed in dimension 1 
as in superior dimensions. 


6.6 COMPARISON OF THE DIFFERENT SEQUENCES 


In this section we use the Quasi Monte Carlo methods to solve an elementary problem. 


Example 6.1 Suppose that we want to evaluate the volume defined below the surface 
yz + x? = 1 in the unit cube [0, 1] x [0, 1] x [0, 1] in R?. In order to find this value, 
we need to generate triplets (a, b, c) in the unit cube and verify that relation bc + a? < 
1 holds. Then we find the fraction of triplets that satisfy the relation. This will give us an 
approximative value of the volume, that is an integral. We used the different QMC methods 


to generate the triplets and present the results in Table 6.4. We compare the results with 
MATLAB’s uniform random number generator. 
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Table 6.4 Comparison of QMC methods 
N MATLAB Halton Faure Sobol Hypercube 
1000 0.8490 0.8610 0.8540 0.8550 0.8400 
2000 0.8605 0.8600 0.8500 0.8540 0.8470 
3000 0.8657 0.8587 0.8507 0.8537 0.8587 
4000 0.8400 0.8570 0.8515 0.8538 0.8548 
5000 0.8462 0.8568 0.8512 0.8556 0.8508 
6000 0.8558 0.8567 0.8522 0.8547 0.8553 
7000 0.8481 0.8564 0.8536 0.8540 0.8570 
8000 0.8526 0.8560 0.8535 0.8538 0.8519 
9000 0.8560 0.8549 0.8537 0.8543 0.8549 

10000 0.8574 0.8546 0.8537 0.8541 0.8542 


We present below the program allowing us to estimate the integral using the different 
sequences. Since the principal functions used to generate QMC sequences have been built 


in previous sections, we use them to generate the points. 


function CompareQMC (NbTraj ) 
$Function that computes the volume under surface 


S$yZt+xX 2=1. 


$NbTraj: Number of points to generate. 


a=rand(NbTraj,1); 
b=rand(NbTraj,1); 
c=rand(NbTraj,1); 


disp(‘Evaluation with MATLAB’); 


mean (b.*c+a.*a<1) 
a=Halton (NbTraj, 3) 
b=Halton (NbTraj,5) 
c=Halton (NbTraj,7) 


1 
r 


' 


disp (‘Evaluation with Halton sequence'); 


mean (b.*c+a.*ža<1) 


a=Faure (NbTraj,1,3); 
b=Faure (NbTraj,2,3); 
c=Faure (NbTraj,3,3); 
disp (‘Evaluation with Faure sequence'); 


mean (b.*c+a.*a<1) 


a=SobolShuffle(NbTraj,1,2,[1;1]); 
b=SobolShuffle(NbTraj,1,3,[1;3;7]); 
c=SobolShuffle(NbTraj,2,3,[1;3;3]); 
disp(‘Evaluation with Sobol sequence’ ) ; 
mean (b.*c+a.*a<1) 

Temp=LHS (NbTraj,3) ; 


a=Temp(:,1); 
b=Temp(:,2) ; 
c=Temp(:,3) ; 


disp(‘Evaluation with latin hypercube sequence’) ; 
mean (b.*c+a.*a<1) 
end 
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$Functions that generates Halton sequence elements 
function rep=Halton(nb, b1) 
rep=zeros (nb,1) ; 
for j=1:nb 
rep (j,1)=VanDerCorput (j,b1); 
end 
end 


SFunction that generates elements of Faure sequence 
function rep=Faure(nb, Dimi, b) 
bn=zeros (nb,1); 
for l=1:nb 
a=decompose (1,b) ; 
m=size(a,1); 
MatTrans=zeros (m,m); 
for j=1:m 
for i=j:m 
MatTrans (j,1)=nchoosek(i-1,j-1); 


end 
end 
a=mod (MatTrans~* (Dim1-1) *a,b) ; 
for f=1:m 
bn(1,1)=bn(1,1)+a(£,1) /b* (£); 
end 
end 
rep=bn; 
function rep=decompose (n,b) 
temp=1; 


while n~=0 
rep (temp, 1)=mod(n,b) ; 
n=floor(n/b) ; 
temp=temp+4+1; 
end 
end 
end 


Function that generates elements of Sobol sequence 
function rep=SobolShuffle (Nb, Polynomial, Degree,M) 
$First step 
m=1; 
y(1)=1; 
x=zeros (Nb,1) ; 
x(1,1)=sy(1)/2°m; 
Principal loop 
for n=2:Nb 
Position=Pos(n-1); 
if Position>size (M) 
M=AddM (M, Polynomial, Degree) ; 


end 
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m=ceil (log(n+1)/log(2)); 
y (n) =bitxor (2* (m==Position) *y(n-1) ,2* (m-Position) *M 
(Position) ); 
x(n,1)=y(n) /2%m; 
end 
rep=x; 
sFunction that finds the position of the 
Srightmost zero in the decomposition 
sof k in base two 
function rep=Pos (k) 
PosTemp=1; 
while bitand(k,1)==1 
PosTemp=PosTemp+1 ; 
k=bitshift (k,-1); 
end 


rep=PosTemp; 
end 
sFunction that computes a new value for M 
function rep=AddM (M, Polynomial, Degree) 
Length=size(M,1); 
Temp=bitxor (2^ (Degree) *M(Length+1-Degree) , 
M(Length+1-Degree) ) ; 
i=Degree-1; 
while Polynomial~=0 
Temp=bitxor (Temp, 2^ (i) *mod (Polynomial, 2) *M 
(Length+1-1i)) ; 
Polynomial=bitshift (Polynomial, -1) ; 
i=i-1; 
end 
rep=[M;Temp] ; 
end 
end 


S$Function that generates element of latin hypercube sequence 
function rep = LHS(N,k) 
Matrix = rand(N,k); 


for j=1:k 

per = randperm(N) ; 

Matrix(:,j) = (per’-1+Matrix(:,j))/N; 
end 


rep = Matrix; 
end 


The volume estimated by MATLAB is close to 0.854. In this simple problem, the results 
using the Halton, Faure and Sobol methods converge faster than those with MATLAB’s 
generator. However, we must remember that this problem is in dimension 3 only. With 
higher dimensions, the Halton and Faure sequences lose their effectiveness much faster 
than the Sobol sequence. For the Latin hypercube method, regardless of the dimension, 
we do not note any loss of efficiency. The hypercube method only redistributes “more 
uniformly” the values generated by MATLAB. 
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Example 6.2 We are still seeking to compare QMC sequences with MATLAB random 
number generator but this time we want to compute the expectation 


E[max(S; — 1.0, S2 — 0.8, $3 — 1.3, 0)], (6.15) 


where S; = e” with Z; ~ N(0, 1) (the dimension of this problem is still 3). For this sim- 
ulation, we transform uniform random variables obtained from the sequences into normal 
random variables. We present the MATLAB code using Halton sequence in dimension 3 
to generate the results. For the other sequences, the adaptations are similar. 


function ComputeExpectation (NbTraj ) 
SFunction that computes the expectation using Halton sequence 
$in dimension 3 
$NbTraj: Number of points to generate 
sTransformation of the uniform variables into normal variables 
S=norminv ( [Faure (NbTraj,2,3),Faure(NbTraj,3,5),... 
Faure (NbTraj,4,5)]); 

sComputation of cash flows given by the function maximum 
Payoff=zeros (NbTraj,1); 
for 1=1:NbTraj 

temp=max (max (exp(S(1,1))-1,0),max(exp(S(1,2))-0.8,0)); 

Payoff (1) =max(temp,max(exp(S(1,3))-1.3,0)); 


end 

disp(‘Mean’); 

disp (mean (Payoff) ) ; 
disp(‘Standard error’); 

disp (std (Payoff) /sqrt (NbTraj) ) ; 
end 


sFunction that generates elements of Faure sequence 
function rep=Faure(nb, Dimi, b) 
bn=zeros (nb,1); 
for l=1:nb 

a=decompose(1,b); 

m=size(a,1); 

MatTrans=zeros (m,m); 

for j=1:m 

for i=j:m 
MatTrans (j,i) =nchoosek (i-1,j-1); 


end 
end 
a=mod (MatTrans*Diml1*a,b) ; 
For-f£=13m 
bn(1,1)=bn(1,1)+a(f£,1)/b* (£); 
end 
end 
rep=bn(:,1); 
end 


sDecomposition of n in base b 
function rep=decompose (n, b) 
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temp=1; 

while n~=0 
rep (temp, 1)=mod(n,b) ; 
n=floor(n/b) ; 
temp=temp+1; 

end 

end 


We show the results in Table 6.5 where the numbers in parenthesis represent the standard 
error (standard deviation divided by the square root of the number of simulations). It is 
interesting to note a slightly smaller variation in the results when using Quasi Monte Carlo 
methods. 


Table 6.5 Comparison of the sequences for an option-like problem 


N MATLAB Halton Faure Sobol Hypercube 


200 2.10 1.87 2.01 1.94 2.15 
(0.19) (0.14) (0.16) (0.16) (0.20) 
400 1.96 1.98 2.02 2.03 2.14 
(0.12) (0.12) (0.12) (0.12) (0.14) 
600 2.05 2.00 2.09 2.07 2.12 
(0.13) (0.10) (0.11) (0.11) (0.12) 
800 2.03 2.04 2.08 2.10 Del} 
(0.10) (0.09) (0.09) (0.09) (0.10) 
1000 2.16 2.05 2.08 2.11 2.15 
(0.10) (0.08) (0.08) (0.09) (0.09) 


As indicated in the introduction, a program using QMC simulations to compute the value at 
risk of a portfolio of bonds is presented later in Chapter 15. 


Notes and Complementary Readings 


For a complementary review of literature on Quasi Monte Carlo simulations, the reader can 
consult the books of Glasserman (2003) and Niederreiter (1992), and articles by Joy, Boyle 
and Tan (1996), Galanti and Jung (1997), Papageorgiou and Paskov (1999) and Tan and Boyle 
(2000). 
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Introduction to Random Processes 


When we observe, for instance, the variations of stock prices, interest rates, currencies and 
commodities, we reckon that the fluctuations cannot be precisely predicted at a future date. 
These variables evolve in a random way. We qualify these processes as random processes. 

A random process, which we denote by X, can be heuristically considered as a function of 
time such that at each moment the value X(t) is a random variable. 

From a mathematical standpoint, a random vector is a function of the space of events in a 
space containing the functions called realizations of the process. Conceptually, a process is 
composed of the set of all possible functions that can be observed. In order to characterize it, 
we will use probabilistic concepts already introduced previously in this book. 

In this chapter, first we define and characterize random processes. Then we treat random 
processes that are widely used in finance, such as the Gaussian or Wiener processes as well as 
Brownian bridges. These processes are very useful to characterize those processes describing 
financial assets values. Numerical solutions of the stochastic differential equations constructed 
from Wiener processes are presented in a more detailed manner in the next chapter. 


7.1 CHARACTERIZATION 
7.1.1 Statistics 


Since a random process is a function of time, the value X at time ¢ is therefore a random 
variable. To characterize it statistically, we need to know its probability density function 


fx, t). 
We write the argument ¢ to point out that this function depends on the time ¢ at which we 
examine the process X. This density known, we can deduce the mean of X(t), my(t), as follows: 


+00 
myx(t) = E [X(t)] = f xfx(x, t)dx. (7.1) 
—00 
If this mean is a function of time, then we can obtain the variance of X(t) as: 
+00 
2 
ox(t) = E [XO —mx)"] = f (x —mx(O) fx, Dax. (7.2) 
—00 
Physically, my(t) represents the mean variation of the process and o? (t) measures the fluctua- 
tion of the process around its mean trajectory my(t). 

If we are now interested in the value taken at two different times, X(t,) and X(t2), we have 
to analyze two random variables. In other words, we are dealing with a 2-dimensional random 
vector. We write X; = X(t,) and X2 = X(t). 

This vector (X1, X2) is fully characterized if we know its joint probability density function 


fxi. x:(%1, X2, ti, t2). (1.3) 
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It is natural to posit that this probability density function depends simultaneously on ¢, and t2 
for which we examine the values of the process. As we have seen in the previous chapter, this 
joint density contains all the information on the marginal densities 


+00 
Fait) =f fayxrienjsti tidy with i,j € {1,2}. (1.4) 


=Q 


Thus, this joint density allows us to compute the mean and variance of the process. Moreover, 
we can compute the joint moment: 


+00 +00 


E [X,X2] = f J kfina n dnd (7.5) 


—00 — 00 
This joint moment denoted by Rx x (tı, t2) is called the autocorrelation function of the process: 
Rxx(ti, h) = E[X(4)X(h)]. (7.6) 


It simultaneously depends on the 2 time values f; and tz. Centered joint moments are defined 
as follows: 


Cxx(t, b) = E KX 0) — mx t) (X6) — mx())]- (7.7) 


We call it the auto-covariance function of the process. 
Now, by considering any n points in time t1, t2,. . ., tn, we can define the joint probability 
density function 


Fx,,.X) grees X, (X15 X25 0665 Xny thy fy «++ 5 tn): (7.8) 


The process is said to be perfectly characterized if, for any n and for any set (t1, f2,..., tn), the 
joint density of order n is completely defined. 

Note that it is generally impossible to characterize a random process. Fortunately, problems 
of interest in finance do not require this perfect characterization. Instead, we only need to 
characterize the first and second order moments. Another way to avoid this problem is to 
describe how the process is going to be generated. 


7.1.2 Stationarity 


In many cases, when we look at a random process’s variations, we note that the process 
fluctuates in a jagged way as illustrated in Figure 7.1. 

Intuitively, we recognize a certain invariant character with respect to time. In fact, this 
random process belongs to the stationary processes (strict stationarity). This class of processes 
is characterized by the fact that the probability density functions of any order 


FX 1X0 R A (X15 X25 60-5 Xn th, ta, -.-, tn) (7.9) 


are independent of time origin. In practice, such a constraint seems strong. Moreover, in the 
following chapters we will usually only use the first order density fx,(%1, tı) and the second 
order density fx,.x,(%1, X2, t1, t2). 
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Figure 7.1 S&P 500 daily returns for year 2004 


This brings us to the definition of a second order stationary processes class. It is defined as 
the set of all processes such that their first order density function fy, (x, tı) and their second 
order one fx,,x,(%1, X2, t1, t2) don’t depend on the time origin. 

Hence, for a stationary process, we have: 


fx, t) = fx@) (7.10) 


and 


fxi x1, X2, t1, t2) = fxi, Xx: X1, X2, b2 — t1). (7.11) 


Which implies that my(t) is a constant and 
Rxx(t, t2) = Rxx(t2 — tı). (7.12) 


Therefore, for the second order stationary processes, the mean is constant and the autocorre- 
lation function depends only on the differences in the instants for which it is defined. 


7.1.3  Ergodicity 


To compile these statistics in a perfect way in practice, we need to have an infinity of process 
realizations. Instead, we only have one realization of the process as a function of time. Based 
on the statistics of the process, if we imagine that this function evolves from —oo to +00, this 
is tantamount to saying that, using these variations from —oo to +00, we can compile all of 
our process’s statistics. Processes having this property are said to be ergodic. 

Since the ergodicity concept is not used often in this book, we do not dwell on it. 


7.2 NOTION OF CONTINUITY, DIFFERENTIABILITY 
AND INTEGRABILITY 


Since a random process is an infinite set of trajectories (abstract representation), concepts 
of continuity, differentiability and integrability for a process must be defined in terms of 
convergence criteria. 
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Obviously, as seen in the previous chapters, many convergence criteria can be used. Since 
our focus is on Monte Carlo simulations, we adopt the quadratic mean convergence criterion. 
This criterion will be used to define the continuity, differentiability and integrability of random 
processes. 


7.2.1 Continuity 
A random process X(t) with finite variance converges in quadratic mean to time point fo if 


E[(X() — Xt] 0: (7.13) 


If X(t) is continuous in quadratic mean for all tọ € [a, b], we say that X(t) is continuous in 
quadratic mean on this interval. Continuity in quadratic mean is characterized by the following 
theorem. 


Theorem 2.1 X(®) is said to be continuous in quadratic mean on interval [a, b] if and only 
if Rx x(t, t2) is continuous on the diagonal a < t; = tz < b. 


Proof: That is: 
E [(X() — X())] = E [X0] + E [Xt] — 2Rxx(t, to). (7.14) 
If X(t) is continuous in quadratic mean at fo, then 
lim Rxx(©, fo) = E [Xto]. (7.15) 
Conversely, if Rxx (t, to) is continuous on the diagonal t = fo then 


lim Rxx(t, to) = Rxx (to, to) = E [X 0]. (1.16) 


7.2.2 Differentiability 


The concept of differentiability in quadratic mean is defined in a similar manner as for the 
continuity. Let X(t) be a random process, we say that X(t) is differentiable in quadratic mean 
at time tọ if there exists a process Y(t) such that: 


PA 
lim e| (oe = ra) l =0. (7.17) 


t> 10 t — to 


Since Y (to) is not known, we can use the Cauchy criterion as follows: 


2 
ioe (Sette oS) | a ea 


tn >0,tn 0 th tm 


If X(t) can be differentiated in quadratic mean for allt € [a, b], we say that X(t) is differentiable 
in quadratic mean and this differentiability in quadratic mean is characterized by the following 
theorem. 
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Theorem 2.2 A process X(t) is differentiable in quadratic mean on the interval [a, b] if and 
only if 
2 


0 
——Ryx(t, t 7.19 
min xx(t1, t2) (7.19) 


exists and is continuous on the diagonal a < t; = tp < b. 


Proof: The proof is similar to the proof of Theorem 2.1. 


7.2.3  Integrability 
Let X(t) be a random process, we subdivide the interval [a, b] into n sub-intervals of length 
b-a 


At = (7.20) 
n 


We say that X(t) is integrable in quadratic mean if there exists a random variable Y such that: 


n—l1 2 
lim E (SE xa +a)x At— r) l =0. (7.21) 
noo 
k=0 
And we note 
b 
i X(tdt =Y, (7.22) 


a 


the integral of X on the interval [a, b]. The integrability in quadratic mean is characterized by 
the following theorem. 


Theorem 2.3 A random process X(t) is integrable in quadratic mean on the interval [a, b] if 
and only if 


b b 
J [ Rect anan (7.23) 


converges. 


7.33 EXAMPLES OF RANDOM PROCESSES 
7.3.1 Gaussian Process 


A process X(t) is said to be Gaussian if its order n probability density function is Gaussian. 
The density function 


I x,.X5 pints: Xa (41, X2, <- -3 Xn b1, b2, .. -3 Én) (1.24) 
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is given by the formula 


(27)? AL? oo zt mya em). ae 
where 
X= (t1,-0-,%n)', m= (mx(t),..-, x(t)" (7.26) 
and 
A = (Ais); jain (7.27) 
with 
Aij = Cxx(ti, tj) = E [(Xi — mx(t)) (X; — mx(t)))]- (7.28) 


For a Gaussian process, if we know its mean trajectory my(t) and its autocorrelation function 
Rxx(t, t”), we can obtain its density of order n for any n. Indeed, we have: 


Aij = Rxx(ti, tj) — mx(G)mx(t;). (7.29) 


In summary, a Gaussian process is perfectly characterized by its mean and its autocorrelation 
function. 


7.3.2 Random Walk 


We subdivide the half-line [0, 00] into sub-intervals of equal duration At. Over the sub-interval 
[kAt, (k + 1)At], X(t) stays constant, that is 


X(t) = X(kAt), Vt €[kAt, (K+ 1)Atl and X(0)= 0. (7.30) 
Then 
X((k + DAt) = X(kKAt) + By, (7.31) 


where the B;’s are independent binary random variables with density function: 
1 1 
Prob(B,; = ô) = 5 and Prob(B, = —ô) = x (7.32) 


The so defined process X(t) is called a random walk. 
We can compute g(t) as follows: 


t 
Vt>0, g(t) =Int (=) ; (7.33) 


where Int(x) stands for the integer value of x. g(t) is the highest positive integer less or equal 
to t divided by At. 
Thus, 


g(t) 
X(t) = X(g(NAt) = X By, (7.34) 
k=1 
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therefore, 


a(t) 
my(t) = E[X(t)] =E È s f (1.35) 
k=1 


Since the B,’s are independent with zero mean, we have my y(t) = 0. The variance of X(t) 
becomes 


ox(t) = Var(X(t)) 


g(t) 
= Var (x a 
k=1 


g(t) 


= J Var(Bi). (7.36) 
k=1 
Since 
Var(By) = 8°, (7.37) 
then 
ox(t) = g(t)d°. (7.38) 


It is interesting to note that the variance of a random walk increases with t while the mean 
remains at 0. Physically, this means that the trajectory of the random process wanders around 
the horizontal axis. Moreover 


g) 


X) -Xt)= J Be, (7.39) 
k=g(t1)+1 
(tz) 

X) -X= D> Be. (7.40) 
k=g(t2)+1 


If ti < tb < t; then g(t) < g(t) < gts). Since B,’s are statistically independent, then 
X(t3) — X (tz) is independent of X(t2) — X (tı). 
Random processes exhibiting this property are qualified as independent increment processes. 
On the other hand, we observe that: 


8(tm) 


Xm) =X) Yo Be, (7.41) 
k=g(ta)+1 
where 
8(tm) = Int (+) and g(t,) = Int (+) 7 (7.42) 
It is clear that 
Prob(X (tm) < x|X (tn), X(tn-1), -< -, X(t1)) = Prob(X (tm) < x|X (tn), (7.43) 


forti <... < tn—1 < tn < tm. Processes with this property are called Markovian processes. 
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7.3.3 Wiener Process 


Let us examine the limit case of a random walk 


g(t) 
X(t) = 5 By, where g(t)= Int (=) : (7.44) 


k=1 


We have seen that mx(t) = 0 and o2 (£) = g(t)8?. Suppose that the jump 4 is proportional to the 
duration of the interval At, 6 = v At, then ož (t) = g(t)At. When At goes to 0, g(t)At tends 
to t. Thus, taking the limit, 


of(t) =t. (7.45) 


On the other hand, when At goes to 0, X(t) is the sum of a large number of independent 
binary random variables. By virtue of the central limit theorem, it converges to a Gaussian 
random variable. 

Taken to the limit, the random walk tends toward a Gaussian random process with zero 
mean, variance ox (t) = t and independent increments. Such a process is by definition a standard 
Wiener process. 


Definition 3.1 W(t) is a standard Wiener process if 


(i) WO) =0, 
(ii) W(t) has independent increments, which means that 


Vi t<tb<t<t4, Wt)—-W(t) and W(th)- W(t) 


are independent, 
(iii) Z(t) = W(t) — W(to) is Gaussian with zero mean and variance 


os =t—, tw St. 


From this definition we can deduce the following basic properties for a standard Wiener 
process. 


Property 3.2 Var(W(f)) = t and my(t) = E[W()] = 0. Since its variance increases with time 
t, the trajectory of this process moves away from the horizontal axis when t increases. 


Property 3.3 From the law of large numbers, we can prove that: 


WO o 


t t> ` 


(7.46) 
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Property 3.4 W’s trajectory, even if it is continuous, varies in a very irregular way that 
makes it non differentiable. Heuristically, this result comes from AW(t) = W(t + At) — W(t) 
being proportional to At. This property is often called Levy's oscillatory property. 


Property 3.5 The autocorrelation function of W(t) is given by: 


Rww(t, fo) = min(f;, t2). (7.47) 


Proof: Indeed, 
Rww(ti, t2) = E [W 6)W )]. (7.48) 
Without loss of generality, we suppose tı < fz then 
Rww(ti, b) = E [W(h)-— Wa) Wn) +W]. (7.49) 


Since W(t) has independent increments, 


E (W(t) — W(t) W (ti)] = E (Wa) E [W(e) — Wr) = 0. (7.50) 
Which implies that 
Rww(t, t2) = E [W0] = Var(W t)) = ti = minți, t). (7.51) 


Property 3.6 The standard Wiener process exhibits a very important property called the 
martingale property, that is: 


E [W (ta) |W (ta-1), W (tn-2), -.., WD] = Wt), t < ta <... <tn. (7.52) 


Proof: Since W(t) has independent increments, we have: 


E[(W(t,)|W(tn_-1), ---, Wt] (7.53) 
or 
E [W (ta) — W(tr—1) + Wtn-1) |W (tn-1), ---, Wt). (7.54) 
Moreover, the expression 
E [W (ta) — W(tr-1)|W(tr_1), ---, W t] (7.55) 


can be rewritten as 


E [W (tn) = W Cta- DIW (tn-0)] =E [W (t,)|W (tn-1)] 
=E [W ta- DIW tn-1) 1 - (7.56) 
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Since W (ta) — W(tn_1) and W(t,_1) are independent, 
E [W (|W (tr) — E [W tn- DIW (tr-1)] = E (Wt) — W Gta-1)] 


=0. (7.57) 
Given that 
E [W tn- W (tn—-1)] = W(tn-1), (7.58) 
we have 
E [W EW tn-1)] = W(tn-1)- (7.59) 


7.3.4 Brownian Bridge 


A very useful process obtained from the standard Wiener process that passes through pre- 
determined points at t = 0 and t = T is called a Brownian bridge and is denoted by Bo, MON 
All realizations of this bridge satisfy: 


Yx, y, BO) =x and By T)=y. (7.60) 


This bridge can be expressed as follows: 
) t 
Bor O =x + WO - 7 WT) -ytx): (7.61) 


This is equivalent to applying the Thales theorem (interpolation) in the random processes case. 
From this definition, we can deduce the mean of the Brownian bridge to be: 


t 
mpg(t)=x — F (x — y) (7.62) 
and its covariance 


Ce.a(ti. fo) = E | (B520) — malt) (B52 2) = ma(e)) | 


tit 
= min(t, h) — T (7.63) 


By means of MATLAB, many Brownian bridges can be generated quickly. Set x = 100, 
y = 105, T = 1. We generate 40 trajectories with 100 steps each. 


function rep=BBridge (NbTraj,NbPas,T,x,y) 
Function that generates many Brownian bridges 
SNbTraj: Number of bridges to generate 

$NbPas: Number of steps in each bridge 

$T: Period on which the bridge is generated 
$x: Starting point of the bridge 

Sy: Final point of the bridge 


DeltaT=T/NbPas; 


sGeneration of the increments and bridges 
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Figure 7.2 Brownian bridge examples 
dW= [zeros (NbTraj,1), sqrt (DeltaT)*randn(NbTraj, NbPas)]; 
W=cumsum (dW, 2) ; 
Inc= [zeros (NbTraj,1), 
repmat (DeltaT/T* (y-x-W(:, NbPas+1)),1,NbPas)]; 
rep=W+x+cumsum (Inc, 2) ; 
sGraphs of the bridges 
figure 
plot ([0:DeltaT:1] , rep) 
end 
An example of possible results obtained from this program is given in Figure 7.2. 
7.3.5 Fourier Transform of a Brownian Bridge 
We take the Brownian bridge 
t 
Bt) = W(t) - ro. (7.64) 


that is x = 0 and y = 0. On the interval [0, T], we can expand B(t) and obtain the Fourier 


series expansion, that is 


B(t) = ao + > (ap cos(kwot) + by sin(kwot)) , 
k=1 


with 


l T 
ao = z | Bid, 
0 


(7.65) 


(7.66) 
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T 
2 
ak = = f Bi costkwonar and (7.67) 
0 
A T 
bg = + feo sin(kwot)dt. (7.68) 
0 


It is important to note that B(t) is a Gaussian process with zero mean and auto-covariance 
function 


tit 
Cs,.g(t, t2) = min(ti, 1) = 7 (1.69) 


This allows us to conclude that a, and b are Gaussian random variables with mean of zero. 


7.3.6 Example of a Brownian Bridge 


In this section, we present a possible application of the Brownian bridges’ method. In finance, 
often the price of a product does not depend only on the final value of the asset. This price 
could be a function of the asset’s price at several dates prior to maturity of a derivative security. 
In this case, to increase the simulation’s speed, we can use Brownian bridges to simulate the 
asset’s price only at the required dates in question. 

For example, suppose that we have a product with price depending on the arithmetic mean 
of the Wiener process on days 75, 80 and 85. We suppose that the process has an initial value 
of 100 today and that its maturity is 90 days. The paid price will be the difference between the 
average price and the strike price of 100. Here are the steps that we have to execute: 


1. Generate a terminal value for W, which we denote by WT. 

2. Generate 20 trajectories W;(t) that will be used to compute the value of the asset for days 
75, 80, 85 and 90. 

3. From these trajectories W;(t), build 6;(t) that correspond to the values of the asset on days 
75, 80, 85 and 90 as follows: 


BAO =x + WO) - LWT) — y +x), (7.70) 


where x represents the initial asset value 100 and y the terminal value WT. 
. Compute the mean of 5;(f) for each trajectory. 
. Compute the final payoff. This payoff is thus associated with the terminal value WT. 
. Repeat steps 1 to 5, 10000 times. 
. Compute the mean of the 10 000 payoffs. 


ADM A 


We present below the MATLAB program that executes these steps. Note that we only generate 
values for the points we are interested in, i.e., the values of W(t) for t = 75, 80, 85 and 90. 


function rep=MeanArith3Days 

$Function that computes the price of an option written on 
the mean of 

the price of an asset. The mean is computed on days 
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%75, 


80 and 85 and we use Brownian bridges to decrease 


Sthe number of necessary simulations 


S$Initial parameters 
NbTraj=10000; 
NbPas=90; 


T=90/360; 
x=100; 
Strike=100; 
Traj=20; 


DeltaT=T/NbPas; 


%šGeneration of final values of the process 
WT=sqrt (T) *randn (NbTraj,1)+100; 


%Vector to save the arithmetic means 
A=zeros (NbTraj,1); 


for 


end 


i=1:NbTraj 


$Generation of the increments to build the bridge for WT (i) 
dW= [sqrt ( (75-0)*DeltaT)*randn (Traj,1), 

sqrt ( (80-75) *DeltaT) *randn(Traj,1),... 

sqrt ((85-80) *DeltaT) *randn(Traj,1), 

sqrt ( (90-85) *DeltaT) *randn(Traj,1)]; 
W=cumsum (dW, 2) ; 


$Building of the Brownian bridge for all the simulations 
Sbetween x and WT(i) 

B(:,1)=x+W(:,1)-(75*DeltaT) /T* (W(:,4) -WT (i) +x); 
B(:,2)=x+W(:,2) -(80*DeltaT) /T* (W(:,4) -WT (i) +x) ; 
B(:,3)=x+W(:,3) -(85*DeltaT) /T* (W(:,4) -WT(i 


> 
x 


$Compute the means for each of the simulations 
TheMean=mean (B, 2) ; 


sCompute the cash flows for each of the simulations 
Payoff=max(0,TheMean-Strike) ; 


$Compute the mean of the simulation for 
%£inal value WT(i) 
A(i)=mean(Payoff,1); 


SReturn the mean of each scenarios 


rep= 


end 


mean(A,1); 


Executing this program, we find that the price should be about 0.30$. We have been able to 
compute the price without having to simulate the intermediate trajectories between the dates 
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we were interested in. In many applications, this allows programmers to save computation 
time and memory. As a matter of fact, when dealing with bigger simulations, we will be 
able to compute the price of a derivative product faster because we only need to generate the 
trajectories on some specific dates. 


Notes and Complementary Readings 


For a complementary review of literature about concepts covered in this chapter, refer to the 
following books of different levels of sophistication: Baxter and Rennie (1996), Bjork (1999), 
Cvitanic and Zapatero (2004), Demange and Rochet (2005), Duffie (2001), Lamberton and 
Lapeyre (1992), Malliaris and Brock (1982), Merton (1992), Neftci (2000). 
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Solution of Stochastic Differential 


Equations 


As mentioned in the previous chapter, most phenomena in finance are random. For example, 
Figure 8.1 shows the daily prices of the American index S&P 500 for year 2004. 

These random phenomena can be analyzed with the statistical tools from stochastic calculus. 

From the heuristic construction of the Wiener process, we introduce the stochastic differen- 
tial equations. However, it is important to note that our objective is not to present an extensive 
treatment of stochastic calculus in this chapter. We rather provide intuitions linked to the 
Ito-Taylor’s lemma and its applications in numerical methods. 

We also present numerical discretization schemes that will be used in simulations. Practical 
application cases in the form of exercises are proposed and MATLAB programs are provided as 
solutions to these exercises. Note that, in all these programs, variance reduction and precision 
improvement techniques covered in Chapter 5 are used. Finally, we introduce processes with 
jumps. 

As we will see in the following chapter on contingent claims valuation, two equivalent 
approaches are used in the valuation of derivative securities in finance: the state variables 
approach which consists of solving partial differential equations and the probabilistic approach 
or equivalent martingale. The equivalence between the two approaches is established by the 
Feynman-Kac formula. Since our approach is based principally on numerical solutions of 
stochastic differential equations using Monte Carlo simulations, this chapter is one of the 
cornerstones of this book. 


1200 
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Value of S&P 500 


1100 


Feb04 Jun04 Sep04 Dec04 
Figure 8.1 Daily S&P 500 index prices for 2004 
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8.1 INTRODUCTION TO STOCHASTIC CALCULUS 
Let X be a random process. We consider the probability space 
{Q,F,P,t € [0, T]} 


generated by the process X, where Q is the sample space, F is the set of all possible events, P 
is the set of probability measures and [0, T] is the time axis. 


Definition 1.1 Filtration: 
The set of all events generated by the random variables 


{X(t), t € [0, t]} (8.1) 


denoted by F, is called a filtration. This filtration F, contains all the available information 
until time t. Thus, we have 


Fi, CF, CF Vt) < to. (8.2) 


The set {F;,, ti € [0, T ]} refers to the concept of sigma-field or sigma-algebra generated by 
the random process X. Generally, in finance, F, constitutes the information available up to 
time t. A rigorous treatment of sigma-algebra or sigma-field or measurable space will push us 
far into the probability measure theory, which is not the purpose of this chapter. (The interested 
reader can start with Williams (1991), for instance.) 


Definition 1.2 Adapted: 

The process X is said to be adapted to the filtration {F;,, ti € [0, T ]} if X is a measurable 
function under (F,,, Q) for all i. In other words, a stochastic process is an adapted process if 
it cannot be anticipated into the future. 


Definition 1.3 Martingale: 
A random process X is a martingale if: 


E[X@|F.,t <t] = X(). (8.3) 


In this definition, the conditional event is the information set available up to time t, F, , which 
contains all past realizations of X up to date t, so the conditional expectation of X with respect 
to F, is equal to X(t). This definition of the conditional expectation is the one used so far. 

We study the random process X on the interval [to, t]. We subdivide the time interval into 
sub-intervals At and look at the increment: 


X(t + At) — X(t) = AX(f). (8.4) 
We study the process from fo to ¢, that is X(t) with tọ < t < t. The conditional means 


E[AX(t)|X(), to < tT < t] (8.5) 
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correspond to what we call the increment prediction AX(f) using the knowledge of the process 
until time t. We denote this prediction by AY (t), then: 


E[AX(t)|X(t), to <T < t] = AY (0). (8.6) 


We can thus write AX(t) = AY(t) + AZ(t) where AZ is the prediction error. Therefore, 
physically, AZ is purely random. Summing the increment predictions from fo until f, we get: 


n-1 n—-1 
X(t) — X(t) = X AY (tu) + D> AZ), (8.7) 
k=0 k=0 
with to < ti <... < tk <.. < tn = t and AY (t) = E[AX(t)|X(t), to < T < t4]. 

Note that any process X(t) has two components: Y(t) and Z(t). Y(t) is the conditional 
expectation of X(t) with respect to the information up to time ¢ and Z(t) is the residual part of 
the process, i.e. X(t) — Y(t). The process Y exhibits less variation since it is the conditional 
expectation with respect to the observed trajectory of the process. The process Z is a purely 
random component that can vary widely since it is residual. 

To summarize, any random process is the sum of two processes: one from the prediction 
being less erratic than the other one which represents the prediction error. 

Let’s come back to equation AX(t) = AY (t) + AZ(t), where AY(f) is the prediction of the 
increment of X at instant t. We expect it to vary slowly as a function of time, that is: 


AY (t) = A(t, At. (8.8) 
When At goes to 0 then: 
ii t 
YA =X A(qAt =, | Acar. (8.9) 
to 
k=0 A 


On the other hand, E[AZ] = 0 and Z(to) = 0. Moreover, 
E [Z(t + At)|Z(t), to <T <t]= Z(t). (8.10) 
This means that Z(t) is a martingale. This implies 


E[AZ(t)|Z(), to < tT <t] =0. (8.11) 


Remark 1.4 It is interesting to note that martingales can be obtained from a transformation 
of the standard Wiener process. This transformation is characterized by the following theorem. 


Theorem 1.5 Lévy-Doob Theorem: 
Let W(t), O < t be a finite variance martingale such that: 
(a) W(0) =0, 
(b) E [dW(t)|W(t), 0 < T < t] = 0 where dW(t) = W(t + dt) — W(n), 
(c) E [AWP IW (T), 0 < t < t| = dt. 


Then W(t) is a standard Wiener process. 
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8.2 INTRODUCTION TO STOCHASTIC DIFFERENTIAL 
EQUATIONS 


8.2.1 Ito’s Integral 
We saw in a heuristic way that a random process can be described by the following equation: 
t t 
X(t) = X(0)+ / a(X(t), T)dt + f b(X(t),t)dW (T). (8.12) 
0 0 


In this model, the first integral can be interpreted as being the quadratic mean integral described 
in the previous chapter. On the other hand, the second integral is of a new type, involving the 
standard Wiener process. This type of integral was first introduced by Ito in 1949. 

On an infinitesimal interval, we can write it in a differential form: 


dX(t) =a (X(t), t)dt +b (X(t), t)dW (t), (8.13) 


where a(X(t), t) and b(X(t), t) are respectively the instantaneous mean and instantaneous 
standard deviation. These parameters generally depend on the past and on the process’s 
trajectory. 

Generally, 


$ t 
XO=X0+ | Adr + | B @awo, (8.14) 
0 0 
where A(t) and B(t) are functionals of X(t) for 0 < t < t. We are interested in the case: 
t t 
fenwa fe [IB (t) |?]dt < œ. (8.15) 
0 0 


Processes that are solutions to this equation are called Ito processes. We already studied the 
integral Í A (t)dt in the quadratic mean sense. For the second integral h B (t)dW (v), Ito 
provided mathematical construct and related calculus rules. Ito’s integral is defined as follows: 

A n—1 


fe (t)dW(t) = lim ` Bt) (Wisi) — W (t)], where tą = pa (8.16) 
n—> œ n 
0 k=0 


Remark 2.1 In the particular case where B(t) is a deterministic function, this integral is called 
a Wiener integral. 


8.2.2 Ito’s Lemma 


Let X be a uni-dimensional process defined as: 


dX(t)=a(X(t), that +b (X(t), dW (t). (8.17) 
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ag ag Wg 


Let Y (t) = g(t, X(t)) where g is twice continuously differentiable, that is and 


? s > Ox? D? ItO 

eg exist and are continuous. We suppose that a, b and g are memoryless transformations, i.e. 
they contain no information (memory) of earlier realizations of the random variable X. 

Since g is twice differentiable with respect to ¢ and X, we can use the Taylor series expansion 


to have: 


se X= s0 X0 seat ge AX) 


1 
Tii 2 apait D as -(Atj (AX j) 


+5 2 SAX oA (AX (8.18) 
Moreover 
(AX j) = (ajAtj +bjAW;’? = (At? + bI(AWY + 2a;bj ATAW. (8.19) 
Heuristically, when Af; goes to 0, then 
(Aty’, AtAW, AW At — 0, (AW(t))? = At — dt (8.20) 
and 
(AXP = b° At — bdt, (8.21) 


since (AW)? = O(AD) as At goes to zero, because AW = vy At where € ~ N(0, 1). Then 


ag ag 
At; 22 
Drs j > f Bar, (8.22) 
J 0 
a ra 
ye ax; > f 8 X(t), (8.23) 
7 Ox; Ox 
J 0 
32 A 32 
& 2 § 12 
2 yr AX) — |Z dt. (8.24) 
J J 0 
Which implies 
ra ra i fa 
g(t, XO) = 800, XO) + | sear + f SIX) + Al TE pdr. (8.25) 
OT ax 2J ax? 
0 0 0 
So 
dg 0g 17 5 
dY (t) = Sdt dX b?dt 
O= aran 


dg dg 1.4078 dg 
= dt — 2 
(2 +a + zb $ | dt +p aw, (8.26) 
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which constitutes Ito’s lemma. So Ito’s lemma is 


a 3 1 a? a 
dy- (E +a(X(t), n£ + -bX (t), jE dt + b(X(t), raw. (8.27) 
at əx 2 3x? ax 


Now, we examine some application examples of Ito’s lemma. 


Example 2.2 In this example, we want to compute the integral 
t 
[wouwe. (8.28) 
0 


To do the computation, we set X(t) = W(t) and choose 
Y(t) = g(t, X(t) 
= sw, (8.29) 
We apply Ito’s lemma and obtain 
dY (t) = W(t)dW(t) + Žar, (8.30) 


So we have 
f wowo = Y(t)—Y(0)— L 
0 


= 1 Wa) r (8.31) 
=; 2 l 


We clearly see that when working with stochastic integrals, we must be careful because the 
calculus rules are not the same as the “regular” differential calculus rules that we are used to. 

The results from this computation can be directly obtained from the Ito’s integral defi- 
nition. Indeed, the integral: 


n—1 


f W aW) = lim >We) WC) — WO) 
0 k=0 


n—1 


= Jim, 27 [Wt — Wto] 
1 2 
— 5 (Wes) — Wd 
1 1 n—1 
_ Ę W2 ain 2 
= iim, sv (tn) 5 2 [AW (t)] l ‘ (8.32) 


We show that 


n—-1 
2 — 
Jim, a [AW (tP — r| =0 (8.33) 
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in quadratic mean. That is, we prove 
n=l 2 
E ( [AWO — ) — 0 (8.34) 
pa At>0 
In fact, 
n-1 2 
E ( (LAW (t)? — a) , (8.35) 
k=0 
once developed gives 
n—-1 n-1 
E | XOX (AW?) — At) (AW?) — At) (8.36) 
k=0 j=0 
which can also be expanded to 
n=l n-i 5 
E | XO XO AW DAW? E) — At (AW?) + AW?) + (At) (8.37) 
k=0 j=0 


For this equation, as the number of steps n is inversely proportional to At, roughly, the 
expression is of order (A?)’, hence in the limit as Aż tends to zero, this expectation vanishes 


and yields 0. Thus we conclude that 


t 
f woawo = SW) = sf (8.38) 
0 
Example 2.3 In this example, we want to compute the integral 
t 
Í traW (r). (8.39) 
0 
We set Y (t) = tX (t) where X(t) = W (t). Ito’s lemma implies 
dY (t) = X(t)dt + td X(t) = d (tX (t)) (8.40) 
or 
t t 
tX(t)= [ xoa + I tdX (T). (8.41) 
0 0 
Thus 
t t 
f tdW(t) =tW(t) — l W(t)dt. (8.42) 
0 0 
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Generally, we have the following integration by part rule: 


t 


f soawo= sowo- f wedge. (8.43) 
0 


0 


8.2.3 Ito’s Lemma in the Multi-Dimensional Case 


Similarly to the one-dimensional case, the same rule applies to the multi-dimensional case. 
Let X € R” be a random vector with process 


dX(t) = A(X(t), t)dt + B(X(t), NdW(t), (8.44) 


where A(X(t), t) € R”, W(t) € R” and B(X(t), t) e R’*”. 
Set 


Y(t) = g(t, X(t) 
= (Y,(t),..., Ya)! , (8.45) 


with g : R x R” — IR‘, then the generalized Ito’s lemma is 


dY;,(t) = ea + 23 stax, O+ 2 


Jy D t dX; (t)dX ;(t). (8.46) 


Using Ito’s lemma, many stochastic differential equations can be analytically solved. How- 
ever, Ito’s lemma was not derived with the objective of finding analytical solutions to differ- 
ential equations. 


8.2.4 Solutions of Some Stochastic Differential Equations 


Although the objective of this chapter is not to provide an advanced treatise of stochastic 
calculus, we will study some stochastic differential equations commonly used in finance. 
Indeed, some relatively simple processes with closed form solutions are often employed in 
finance, such as the Ornstein-Uhlenbeck process and the geometric Brownian process. The 
Ornstein-Uhlenbeck process is used to describe mean reverting processes. An example of its 
use can be found in models of interest rate movements. The geometric Brownian motion is 
frequently used to model movements of an asset in the Black-Scholes-Merton model. We 
describe these processes in the following examples. 


Example 2.4 Ornstein-Uhlenbeck process 
Let X be a random process represented by 
dX(t 
Z0 = —a X (t) + b(t), (8.47) 
W(t) 


d 
where & follows a normal distribution. If we set E(t) = A with W the Wiener process, 


then 


dX(t) = —aX(t)dt + bdW (t). (8.48) 
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The computation of this differential equation implies 
t 
X(t) = XOe" + f e“"—) bdW (s). (8.49) 
0 


In finance, this mean-reverting process is often used to describe the dynamics of interest 
rates and stochastic volatilities of asset returns. 


Example 2.5 Log-normal process 
Consider the process 


dX (t) = aX (t)dt + bX (t)dW(t) (8.50) 
or 
X(t) = X(O)+ I aX(s)ds + / bX(s)dW(s). (8.51) 
0 0 
Set 
Y) = g(Z@) = e7” (8.52) 
with 
dZ(t) = adt + dW (t). (8.53) 


Ito’s lemma implies 
aro = Baz + 32 8azwy 
= Y(t)(adt + BdW(t)) + SY (pear 
= Y(t) (« $ 5#) dt + BY(t)dW(t). (8.54) 
If we choose a and £ such that: 
dZ(t) = (o - 5) dt + bdW (t), (8.55) 
then we note that X(t) = e@. Since Z can be written as 
Z(t) = Z(0) + (« - z) t +bW(t), (8.56) 


this brings us to the exact solution 
X(t) = Xe VWO with XO = X Ol = €2. (8.57) 


This process is regularly used to describe financial assets dynamics. 


In practice, it is often very hard, if not impossible, to obtain an explicit solution of a stochastic 
differential equation. In such situations, we need to find a numerical solution using Monte 
Carlo simulation techniques introduced in this book. 
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8.3 INTRODUCTION TO STOCHASTIC PROCESSES 
WITH JUMPS 


We saw that the classical model to represent the variation in financial asset values is of the 
form: 


dX(t) = w(t, X(t))dt + a(t, X())dW(t), (8.58) 


where X(t), u and o are respectively the value of process X at time ż, the drift and diffusion 
coefficient. W (t) is a standard Wiener process. 

In reality, sharp and large variations for X(t) do occur. These sharp and large variations are 
linked to market-microstructure mechanisms and information arrivals and most of the time 
represent sudden events. Merton (1976) proposed to account for these jumps in the model 
shown in (3.1). We have a diffusion process with jumps under the form 


dX(t) = wt, X(t))dt + o(t, X(t))dW(t) + J (t, X(t_))dN(), (8.59) 


where N (t) is a Poisson process. In the simplest case, N (t) is a Poisson process with parameter 
A, that is if N (t) represents the number of jumps on interval [0, t], it follows 


au At)" 
nt? 


Prob(N(t) =n) =e n=0, 1, 2,... (8.60) 


In this case, 
Prob(d N (t) = 1) = Adt (8.61) 


and J (t, X(t_)) corresponds to the jump’s amplitude. 
Equation (3.2) can be written: 


x) = Xco)+ | pis, Xoyas + | o(s, X(s))dW(s) 


to to 


+f J (s, X(s_))dN(s). (8.62) 


to 


Before the jump, that is at time f, equation (3.5) is of the form 


X(t) = X(to) + f u(s, X(s))ds + f o(s, X(s))dW(s), (8.63) 


to 


and the jump adds its effect to (3.6) to yield 
X(t) = X(t_)+ J (t, X(t_)) dN), (8.64) 


where dN (t) is the number of jumps that occurred between t_ and t_ + dt. 


Example 3.1 We consider the simple following case: 
dX(t) = uX(t)dt +oX(thdW(t) + X(t_)dJ (t), (8.65) 


where u and o are constant drift and diffusion coefficients and dJ represents the relative 
amplitude of the jump at time t. 

dX(t) has two components: one coming from the log-normal process and the other 
originating from jump dJ/(t). The term d/(t) represents the relative amplitude of the jump 
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at time t. We see that dJ(t) gives us information about the jump: the probability of the 
jump happening at time ¢ and the relative amplitude of the jump. We can represent it by 
YndN (t), where y, is the relative amplitude of the jump and 


Prob(dN(t,) = 1) =Adt,  Prob(dN(t,) = 0) = 1 —Adt. (8.66) 


Without the jump, we have 


X (th) = X (E7 Jet Parroaweo, (8.67) 


and if a jump occurs at time t„, equation (3.10) becomes 


o2 1 
XT) = XC eT NOT + yy). (8.68) 


We assume that y „ are independent, identically distributed random log-normal variables, 
i.e., Ga = lIn(1 + yn) is a Gaussian random variable. Given these characterizations, the 
solution of the simple model (3.8) is: 


a2 m 
X(t + h) = X (HeH THVI Gn, (8.69) 


where m is the total number of jumps happening between ¢ and t + h. G, are independent 
identically distributed Gaussian random variables and Z is a standard Gaussian random 
variable independent of Gq. 


8.4 NUMERICAL SOLUTIONS OF SOME STOCHASTIC 
DIFFERENTIAL EQUATIONS (SDE) 


Let a vector of stochastic differential equations be 
t t 
X(t) = X (to) + I a(X (s), s)ds + f b(X (s), s)dW (s), (8.70) 
to to 


with X € R¢, a € R?, W € R”, and b € R?*”., This vectorial equation can be decomposed 
into d differential equations as follows: 


t m t 
X(t) = Xx(to) + f a(X (s), s)ds + Df bi (X(s), s)XAW' (s), (8.71) 
to i=1 Y" 
fork = 1,..., d. 
So, let us consider the following representative one-dimensional differential equation 
dX(t) = a(X (t), t)dt + b(X (t), dW (t) (8.72) 
or equivalently 
t t 
X(t) = X (to) + i a(X (s), s)ds + f b(X (s), s)dW (s), (8.73) 
to to 


with a representing the drift or trend and b the diffusion coefficient or the volatility. To solve 
this equation numerically, we have to: (i) discretize the time and space, and (ii) use an iterative 
technique that will allow us to arrive at the results. 
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8.4.1 Ordinary Differential Equations 


To get a grasp on the solution of stochastic differential equations, we need to recall the schemes 
for numerical discretization of ordinary differential equations. This enables us to understand 
Euler and Milstein discretization schemes that will be covered later. We present some classical 
schemes that are most often used in ordinary differential equations discretization. 

Set the ordinary differential equation 


dx =a(x,t)dt, with initial condition x(to) = xo. (8.74) 
We define 
trot = th FAL X(tn41) = Xn + AX = Xn41 (8.75) 


and yj41 = Y(tn+1) the linear approximation of x,41, ie. 


d 
Vln + At) © y(n) + Ar. (8.76) 


Figure 8.2 provides an illustration of the approximation using the Euler method we discussed 
below. 
The local error is obtained by 


eln + 1) = [Xn41 — Yn (8.77) 
and the global error is 
€g(n + 1) = [Xn41 - Y(n+1,x0)|5 (8.78) 


where y(n41,x9) 18 the approximation value at ¢,,, with the starting value xp using equation 
(4.7). We say that the approximation is stable if, for At — 0, the error goes to zero. 

Many approximation techniques exist to numerically estimate this differential equation. We 
list some of these below. 


Figure 8.2 Approximation using the Euler method 
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Euler method 


We consider the approximation 


Yn+1 = Yn + aYn, tn )At, (8.79) 


that corresponds to the Ist order Taylor series expansion. If €, < K(At)”, we say that y is 
the speed or convergence degree of the algorithm. For deterministic equations, Euler method 
gives us unitary speed y = 1. 


Runge-Kutta method 
This method is based on the following approximation scheme 
1 
Ynt1 = In + — (ki + 2k; + 2k} + kf) At, (8.80) 

where 

ki= == ayn, tn), 

k? n + ta 1At, tn + EAP 

=a 
n 2 n 2 
2 n 

kt =a (Yn + kè At, tnt) - 


3 l2 1 
kè =a | yn + =k At, sin + At), 


This is the fourth-order Runge-Kutta algorithm that has an accumulated error proportional to 
(Any. 


Implicit method 
an tn) + a(Yn+1; tn+1) 
2 


1 
Yn+1 = Yn + 5 COn fn) + Ant Ings AL. (8.81) 


We take the slope 


, which brings us to the numerical approximation 


While Euler and Runge-Kutta are explicit approximation methods, this method is implicit 
because the solution is obtained iteratively. 


8.4.2 Stochastic Differential Equations 
Tto-Taylor expansion 


From equation 


t t 
X= Xy +f ads +f bdW,, (8.82) 
to to 


if we set f (X,) = Y;, by way of Ito’s lemma we obtain 


af 1 5 
Y, = X X 
dY, = zydi + dX, + 553 (dX,) 
of af 23° f af 
b dt +b—dW,. 8.83 
(Prats; Sep ge ce) 
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From this, 


t 2f 
FX) = Fy) + | (rats Sts 5) ds + [95 plaw, 
t \OS 0 2? 


Function f doesn’t explicitly depend on time then: 


Of lege ge af 
dY,=(a—+-=b dt + b——dW,. 
i (a Ogee ye eae 


We can rewrite it as 


dY, = L? fdt + L! fdW,, 


with 
af 1 soot 
L` f = a— + -b —, 
TE ae 2 3x? 
af 
Lif =b—= 
f Ox 


To ease the algebra, we will use the two operators L°? and L!. Thus 


f= 7+ f L f(X,)ds + f L! f(X,)dW,. 


to to 


(8.84) 


(8.85) 


(8.86) 


(8.87) 


(8.88) 


(8.89) 


If we apply equation (4.20) (i.e. apply operators L° and L!) to functions a and b, we get 


X,=X,, +f (axa) f Laxaz+ f L'acxaw.) ds 


to to 


$ f (baw + f ” K(X, )\dz + f “Loc dW.) dWs. 


This implies 
t S 
X, = Xu HAXE to) + HX NW, Wo) + | f L'BXDAW.aw, +0, 
to 
with O containing terms of higher order or equal to £. And since 
t Ss t Ss Z 
/ f L'b(X.)dW.dW, = if 1 (L'b(X,,) es f L°L'b(X,)dv 
to Jt to Jt to 
+ f LL", yaw, )aW-dW, 
to 
we finally have 
ðb t S , 
Xi = Xn FAX (t — to) + DX nW, — Wo) + D dW.dW, +O, 
to to 


with O’ containing terms of higher order or equal to 1°. 


(8.90) 


(8.91) 


(8.92) 


(8.93) 


Although the above equations involve tedious algebra, X can be approximated by Y as 


follows: 


ob fk+1 s 
Y1 = Yi + alan +DYDAW + 50%) f 1 dW,dwW,. 
tk tk 


(8.94) 
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The solution which satisfies this equation can be weak or strong depending on the type 
of convergence used. Note that the convergence in quadratic mean of sequence of random 
variables has been discussed in Chapter 3. The strong solution is preferred to study options 
which depend on the asset value paths (for example American and Exotic options). However, 
for European options, we only require that X converges weakly to Y since we only need the 
final expected values to be equal at maturity. 

Strong solutions take more computer time to obtain since they need the path of the solution 
to be close to the trajectory of the exact value. The weak solution needs only the moments at 
terminal points to be the same for any trajectory. In this case, we can use the Euler scheme 
which is an algorithm of degree 0.5. 


Euler scheme 


We retain only the terms of order 1 in the Taylor expansion. In this case, we have the Euler 
scheme 


Your = Yk +a(Y p At + DY, JAW. (8.95) 


As we said above, this algorithm is of degree 0.5. To obtain a higher degree approximation, 
we can use the Milstein scheme. 


Milstein scheme 


From the approximation of X, 
0b tet s 
Yar = Ye HAYDA + DYDAW: +DVD YO f f awaw,, (8.96) 
tk tk 


we compute the last term 


tk+1 s 
f ( / d w.) dW, 
tk tk 


tk+1 
i (Ws — Wd Ws 
tk 


1 2 2 
= 5 Wen wi (tht t)) 
—W, (Writ — We) 


1 1 
= z^ + 5 Wen Wi)(Wrai + We — 2W;) 
= LAW y Lay (8.97) 
=i 7 k 7 > å 


then 
1 ðb 2 
You, = Y +a (Yp) At + bY AW; + PYD YOACAW:) — At). (8.98) 


This approximation is called the Milstein scheme. This scheme gives us a precision of 
order 1. 

Note that these approximation schemes have been developed for one-dimensional cases. 
For higher than one dimension stochastic differential equations, the interested reader can read 
Kloeden and Platen (1992). However, using the simple Milstein scheme in (4.29) combined 
with the Cholesky decomposition in the multi-dimensional case can lead to satisfactory results 
relative to Euler ones. 
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8.5 APPLICATION CASE: GENERATION OF A 
STOCHASTIC DIFFERENTIAL EQUATION USING 
THE EULER AND MILSTEIN SCHEMES 


In this section, we present an application case using the different discretization schemes that 
we previously presented. The aim of this exercise is to simulate a stochastic process using the 
discretization schemes of Euler and Milstein. 

Consider the following stochastic differential equation: 


dX(t)=aX(t)dt + bX(t)dW(t), (8.99) 


with X(0) = 100, a = 0.05 and b = 0.20. This equation can be solved as 
2 


log(X(t)) = log(X(0)) + (o — =) t+ bW(t), (8.100) 
which yields 
XH) = Xe 7 DWO, (8.101) 


Based on this expression, in MATLAB, the simulation of trajectories can be performed in the 
following way. 


Steps to simulate trajectories. 
SNbTraj: Number of trajectories. 


$Computation of gaussian variables 
WT = sqrt(T)*randn(NbTraj, 1); 


sComputation of trajectories 
Traj = X0*exp((a-b*b/2) *T+b*WT) ; 
Using a discretization step of A, the Euler scheme is described by 
Xn+1 = Xn +aX,At + bX, AW), (8.102) 
and the Milstein scheme is given by 
1 
Xni = Xn +aX,At + bX, AW, + 5P Xn(AWay — At), (8.103) 


where AW, is a Gaussian random variable with zero mean and variance of At. 
For Euler’s approximations, the trajectories can be simulated with MATLAB in the following 
way: 


Steps to compute Euler approximations 
SNbTraj: Number of trajectories 
SNbStep: Number of step to simulate 


sComputation of Gaussian variables 
DeltaW = sqrt (DeltaT) *randn(NbTraj,NbStep) ; 


Matrices used to store the approximations 
Traj=zeros (NbTraj,NbStep+1) ; 
Traj(:,1) = X0*ones (NbTraj,1); 
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SLoop that simulates all the time steps 
for i=1:NbStep 

Traj(:,i+1) = Traj(:,i).* (l+a*DeltaT+b*DeltaW(:,1i)); 
end 


For Milstein’s scheme, we need to add the term $b°((AWn)? — Af) just at time i + 1. 


%šSteps to compute Milstein approximations 
SNbTraj: Number of trajectories 
SNbStep: Number of steps to simulate 


SComputations of the Gaussian variables 
DeltaW = sqrt (DeltaT) *randn(NbTraj,NbStep) ; 


S$Addition for Milstein approximation 
DeltaW2 = DeltaW.*DeltaWw; 


SMatrices used to store the approximations 
Traj=zeros (NbTraj ,NbStep+1) ; 
Traj(:,1) = X0O*ones(NbTraj,1)j; 


SLoop that simulates all the time steps 
for i=1:NbStep 
Traj(:,i+1) = Traj(:,i).*(1l+a*DeltaT+b*DeltaW(:,i) + 
b*2/2* (DeltaW2(:,i) - 
DeltaT*ones (NbTraj,1))); 
end 


When we perform Monte Carlo simulations, it is important to provide a measure for the 
quality of the obtained results. Sensitivity studies allow us to observe the variation in the 
quality of the simulations when we change the number of simulations, the confidence interval 
or the discretization step. 


8.5.1 Sensitivity with Respect to the Number of Simulated Series 


For each of the schemes, simulate 10, 100, 1000, 10000 and 100000 series (NbSeries) of 
NbTraj = 1000 simulations of this process on 365 days and considering a weekly discretization 
step At = 1/52. Then compute the absolute approximation error € with a confidence interval 
of a = 95% for each scheme. 

We use Euler and Milstein methods already coded with the parameters given above. Below 
is a program for the Euler method case. We employ a method using the logarithm of X to 
optimize the program with matrix functions in MATLAB. In fact, we simulate the value of the 
logarithm of X„+1. For Euler’s approximation, we have 


log(X,41) = log(X,(1 + aAt + bAW,)) 
= log(X,,) + log + aAt + bAW,,) 


n 


= log(Xo) + J log + aAr + bAW;). (8.104) 
i=0 
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To get the exact solutions, we can also use a similar method. Computing the value of the 
logarithm of X(t), we have 


log(X(t)) = log(X (Qe@— 2?" +9W 0) 


1 
= log(X(0)) + (o — 3) t+bW(t). (8.105) 
The exact solutions at point t = (n + 1)At is given by 


log(X ((n + 1)At)) = log(X (0)) + (o — z) (n+ DAt + bW((n + DAt) 
= log(X (0)) 


` 12 ; : 
+). ((« z. ) At+bW(i + DAN- WG an) (8.106) 
i=0 


function rep=Euler (NbSeries,NbTraj,NbStep,alpha) 

$Function that computes the approximations error when we use 
Sthe Euler discretization and presents the confidence interval 
sNbSeries: Number of series to simulate 

SNbTraj: Number of trajectories by series 

sNbStep: Number of steps to simulate 

Salpha: Confidence level for the error 


SInitial parameters 
X0=100; 

a=0.05; 

b=0.20; 

Taly 

DeltaT = 1/NbStep; 
SqDeltaT = sqrt (DeltaT); 


sVector of average epsilons for each series 
EpsilonSeries = zeros (NbSeries,1); 


$For each series we simulate the analytical solutions and 
Sthe approximations 
for i=1:NbSeries 


%Simulation of the Gaussian variables 
DeltaW = SqDeltaT*randn (NbTraj,NbStep) ; 


sConstruction of exact solutions 

Increments = (a-b*b/2)*DeltaT + b*Deltaw; 
LogPaths=cumsum( [log (X0) *ones (NbTraj,1),Increments] ,2) ; 
SPaths = exp(LogPaths) ; 


sConstruction of Euler approximations 
EulerInc = log([X0*ones (NbTraj,1),ones(NbTraj,NbStep) +... 
a*DeltaT*ones (NbTraj,NbStep) +b*DeltawW] ) ; 
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Table 8.1 Sensitivity of the results to the number of series 


Euler Milstein 
NbSeries € Interval € Interval 
10 0.3287 (0.3257, 0.3316] 0.0170 [0.0167, 0.0173] 
100 0.3277 (0.3265, 0.3290] 0.0169 [0.0168, 0.0170] 
1000 0.3281 (0.3277, 0.3286] 0.0170 [0.0170, 0.0170] 
10000 0.3278 (0.3277, 0.3280] 0.0170 [0.0170, 0.0170] 
100000 0.3279 [0.3279, 0.3280] 0.0170 [0.0170, 0.0170] 


EulerIncCum = cumsum(EulerInc, 2); 
Euler = exp(EulerIncCum) ; 


sTemporary variable to store the approximation errors 
temp=0; 
for j=1:NbTraj 

temp=tempt+abs (Euler(j,NbStep+1) - SPaths(j,NbStep+1) ) ; 
end; 


sComputation of the average of the errors for series 
EpsilonSeries (i,1)=temp/NbTraj; 
end; 


sComputation of the average and variance of errors for all series 
Epsilon = mean(EpsilonSeries,1); 
Var = var(EpsilonSeries) ; 


sComputation of the confidence interval 
DeltaEpsilon = tinv(alpha,NbSeries-1)*sqrt ((Var/NbSeries) ) ; 


sConfidence interval at level alpha% 
rep=[Epsilon-DeltaEpsilon, Epsilon, Epsilon+DeltaEpsilon] ; 


We obtain the results presented in Table 8.1. Note that the confidence interval decreases 
with the number of simulated series. 


8.5.2 Sensitivity with Respect to the Confidence Interval 


We repeat the preceding simulations with NbSeries = 1000, but this time we compute the 
approximation error as a function of a = 90%, 95%, 99%. 

The results are presented in Table 8.2. We see that, based on the desired level of confidence, 
we come up with slightly different intervals. In other cases, the effect of the parameter œ can 
be much more important. 


8.5.3 Sensitivity with Respect to the Number of Simulations 


We repeat the simulations with NbSeries = 1000 and a = 95%, but this time we compute 
the approximation error as a function of the number of simulations NbTraj = 10, 100, 1000, 
10000. The results are presented in Table 8.3. 
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Table 8.2 Sensitivity of the results to the confidence interval 


Euler Milstein 
a € Interval € Interval 
0.90 0.3283 [0.3279, 0.3286] 0.01697 [0.01695, 0.01699] 
0.95 0.3274 [0.3269, 0.3278] 0.01699 [0.01696, 0.01701] 
0.99 0.3286 [0.3279, 0.3292] 0.01696 [0.01692, 0.01699] 


Table 8.3 Sensitivity of the results to the number of simulations 


Euler Milstein 
NbTraj € Interval € Interval 
10 0.3305 [0.32615, 0.32805] 0.01688 [0.016632, 0.017135] 
100 0.3267 [0.32538, 0.32805] 0.01703 [0.016944, 0.017109] 
1000 0.3279 [0.32745, 0.32832] 0.01699 [0.016968, 0.017017] 
10000 0.3279 [0.32778, 0.32804] 0.01698 [0.016968, 0.016983] 


Table 8.4 Sensitivity of the results to the discretization step 


Euler Milstein 
At € Interval € Interval 
1/12 0.67480 [0.67386, 0.67574] 0.07201 [0.07190, 0.07212] 
1/52 0.32784 [0.32740, 0.32827] 0.01698 (0.01695, 0.01700] 
1/365 0.12427 (0.12411, 0.12444] 0.002431 [0.002427, 0.002434] 


8.5.4 Sensitivity with Respect to the Time Step 


Using parameters NbSeries = 1000, NbTraj = 1000, œ = 95%, compute the approximation 
error as a function of At = 1/365 (days), 1/52 (weeks) and 1/12 (months). 

The results are presented in Table 8.4. The discretization step can significantly affect the 
results precision. 


8.6 APPLICATION CASE: SIMULATION OF A 
STOCHASTIC DIFFERENTIAL EQUATION WITH CONTROL 
AND ANTITHETIC VARIABLES 


In this section, we present an application case to illustrate the use of simulation quality 
improvement concepts described in previous chapters. 
We take the same stochastic differential equation: 


dX(t) = aX (tdt + bX (t)dW(t) (8.107) 


with X(0) = 100, a = 0.05 and b = 0.20. 
In the following section we use Euler’s scheme with discretization step At = 1/100 on one 
period T = 1. 
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Table 8.5 Simulations without variance reduction technique 


Mean Variance 
NbTraj Estimate Interval Estimate Interval 
10 105.18 [104.82, 105.54] 406.88 (395.76, 418.00] 
100 105.25 [105.13, 105.37] 445.84 [441.95, 449.73] 
1000 105.12 [105.08, 105.15] 451.34 [450.05, 452.63] 


10000 105.13 [105.12, 105.14] 450.67 (450.29, 451.06] 


8.6.1 Simple Simulations 


Perform NbSeries = 1000 series with NbTraj = 10, 100, 1000, 10000 simulations to come up 
with the mean and the variance of the process X at T = 1, denoted by E[X(1)] and Var(X(1)). 
Then compute the confidence interval. 

Table 8.5 presents the simulations results. 


8.6.2 Simulations with Control Variables 


We use the same program but introduce here a control variable. We proceed in the following 
way. For each generated value of X(1), we use the control variate 


100 
> AW, (8.108) 
n=0 


to readjust the value of X(1). This can be done quickly in MATLAB by: 


function EulerCont (NbSeries, NbTraj, NbStep, alpha) 
sFunction that computes the mean and variance of variable X 
Susing the control variables method 

SNbSeries: Number of series to simulate 

SNbTraj: Number of trajectories for each series 

SNbStep: Number of steps to generate 

Salpha: Level of confidence for the error 


SInitial parameters 
X0=100; 

a=0.05; 

b=0.20; 

DeltaT = 1/NbStep; 
T=1.% 


SqDeltaT = sqrt (DeltaT) ; 
SAverages and variances vector 
AverageSeries = zeros(NbSeries,1) ; 


VarSeries=zeros (NbSeries,1)j; 


for i=1:NbSeries 
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sSimulation of Gaussian variables 
DeltaW = SqDeltaT*randn (NbTraj,NbStep) ; 


$Euler approximations with logarithms method 

EulerInc = log([X0*ones(NbTraj,1), ones(NbTraj,NbStep) + 
a*DeltaT*ones (NbTraj,NbStep) +b*DeltaW] ) ; 

EulerIncCum = cumsum(EulerInc, 2); 

Euler = exp (EulerIncCum) ; 


SControl variable 

A=Euler(:,NbStep+1) ; 

B=sum(DeltaW, 2) ; 
MatrixCovariance=cov (A,B); 
correction=-MatrixCovariance (1,2) /var(B) ; 
A=A+correction* (B-mean(B) ); 


AverageSeries(i,1)=mean(A,1); 
VarSeries(i,1)=var(A,1); 
end 


$Computation of the average of averages and variances of the series 
Average = mean (AverageSeries,1)j; 
Variance=mean(VarSeries,1); 


sComputation of the variance of the averages and variances of series 
VarAverage = var (AverageSeries,1); 
VarVariance=var (VarSeries,1)j; 


$Computation of the confidence interval for the mean and variance 
DeltaAverage=tinv (alpha, NbSeries-1) *sqrt (VarAverage/NbSeries) ; 
DeltaVariance=tinv(alpha,NbSeries-1)*sqrt (VarVariance/NbSeries) ; 


IntervalAverage = [Average-DeltaAverage, Average, Average+ 
DeltaAverage] 
IntervalVariance = [Variance-DeltaVariance, Variance, 


Variance+DeltaVariance] 
end 


Simulation results are shown in Table 8.6. After running the preceding program 1000 times 
we obtain the confidence interval for the mean and the variance. 


Table 8.6 Simulations with control variables 


Mean Variance 
NbTraj Estimate Interval Estimate Interval 
10 105.30 (104.82, 105.52] 35.92 [5.12, 5.92] 
100 105.17 [105.06, 105.27] 8.33 [8.13, 8.53] 
1000 105.12 [105.09, 105.16] 8.83 [8.76, 8.90] 


10000 105.12 [105.11, 105.13] 8.86 [8.84, 8.89] 
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8.6.3 Simulations with Antithetic Variables 


We use antithetic variables to increase the precision of each series. We proceed as follows to 
generate the variables AW. 


function EulerAnti (NbSeries,NbTraj,NbStep, alpha) 

sFunction that computes the mean and variance of variable X 
S$using the antithetic variables technique 

SNbSeries: Number of series to generate 

$NbTraj: Number of trajectories for each series 

SNbStep: Number of steps to generate 

Salpha: Confidence level for the error 


S$Initial parameters 
X0=100; 

a=0.05; 

b=0.20; 

DeltaT = 1/NbStep; 

T=1; 

SqDeltaT = sqrt (DeltaT) ; 


sAverages and variances vector 
AverageSeries = zeros(NbSeries,1) ; 
VarSeries=zeros (NbSeries,1)j; 


for i=1:NbSeries 


%Generation of random variables and antithetic variables 
DeltaW = SqDeltaT*randn (NbTraj,NbStep) ; 
DeltaW=[DeltaW; -Deltaw] ; 


$Euler approximations with logarithms method 

EulerInc = log([X0*ones(2*NbTraj,1), ones (2*NbTraj, 
NbStep) + 

a*DeltaT*ones (2*NbTraj,NbStep)+b*Deltaw] ) ; 

EulerIncCum = cumsum(Eulertnc, 2) ; 

Euler = exp(EulerIncCum) ; 


AverageSeries (i,1)=mean(Euler(:,NbStep+1),1); 
VarSeries(i,1)=var(Euler(:,NbStep+1),1); 


end 


$Computation of the average of averages and variances of series 
Average = mean(AverageSeries,1)j; 
Variance=mean (VarSeries,1); 


sComputation of the variance of averages and variances of series 
VarAverage = var (AverageSeries,1); 
VarVariance=var (VarSeries,1)j; 
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Table 8.7 Simulations with antithetic variables 


Mean ($) Variance ($7) 
NbTraj Estimate Interval Estimate Interval 
10 105.09 (105.05, 105.14] 441.40 [430.96, 451.84] 
100 105.11 (105.10, 105.13] 447.11 [443.63, 450.58] 
1000 105.13 (105.12, 105.13] 450.87 [449.77, 451.97] 
10000 105.12 (105.12, 105.13] 450.22 [449.87, 450.57] 


sComputation of the confidence interval for the mean and the 
variance 
DeltaAverage=tinv (alpha, NbSeries-1) *sqrt (VarAverage/NbSeries) ; 
DeltaVariance=tinv(alpha,NbSeries-1) *sqrt (VarVariance/NbSeries) ; 


IntervalAverage = [Average-DeltaAverage, Average, Average+ 
DeltaAverage] 
IntervalVariance = [Variance-DeltaVariance, Variance, 


Variance+DeltaVariance] 


end 
Simulation results are presented in Table 8.7. 

The reader is invited to study simulation cases combining several different techniques of 
variance reduction and precision enhancement as practice exercises. 


8.7 APPLICATION CASE: GENERATION OF A STOCHASTIC 
DIFFERENTIAL EQUATION WITH JUMPS 


For this application case, we consider an alternative version of equation (3.12) from Merton’s 
(1976) work: 


S(t 4+ h) = Set FM OSa ht oVhZ pm(uyz—0.50f)+0y ae G; ; (8.109) 


where uy and o; are the average and standard deviation of the jump’s magnitude, and Z and 
G; are Gaussian random variables. u represents the instantaneous expected rate of return of 
the asset, ô is the dividend rate, ø is the instantaneous volatility of the asset’s return, A is the 
yearly average of the number of jumps, m is a Poisson random variable with mean à and A is 
the time step. The case without jump can be obtained with m = 0, wy =o =A =O. 

This stochastic differential equation with jumps represents a financial asset’s price process. 
Merton (1976) assumes the jumps to be independent and idiosyncratic, meaning that jumps 
are diversifiable. Diversification means that by holding a portfolio of several financial assets, 
one is more likely to cancel the effect of jumps all together. 

Parameter k is given by: 


kK=e"—1, (8.110) 


with uz > 0. k representing the negative value of the expected percentage jump. 

Using the following values for the parameters, S(0) = $ 100, wy = 0.02, oz = 0.10, u = 
0.05, 6 = 0, ø = 0.30, A = 3 and T = 1 year, perform the simulations of the processes with 
and without jumps and plot their graphical representations. 
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In MATLAB, a program simulating these trajectories with jumps might look like the 
following one (where we directly generate log(S(t))). We use 250 steps per year and the 
variable NbDays represents the number of days for the simulation. In our case, we also 
have 250 because the simulation covers a whole year. A MATLAB program is written as 
follows: 


function Jump 
$Function that simulates a stochastic process with jumps 


SInitial parameters 
X0=100; 

mu=0.05; 

Div=0.00; 

Lambda= 3; 
sigma=0.3; 
muJ=0.02; 
sigmaJ=0.10; 

TS 

NbStep=250; sNumber of steps per year 
DeltaT=1/NbStep; 
NbDays=T*NbStep; 
k=exp (-muJ) -1; 


%Simulation of Poisson and Gaussian random variables 
PoissonG=poissrnd (Lambda/NbPas,1,NbDays) ; 
DeltaZ=randn(1,NbDays) ; 

PoissonJumps=zeros (1,NbDays) ; 


sDetermination of jumps for the Poisson process 
for i=1:NbDays 

PoissonJumpss (1,i)=sum(randn(PoissonG(1,i),1)); 
end 


$Constructing the increments 

Increment sJumps= (mu-Div-Lambda*k-0.5*sigma*2) *DeltaT*... 
ones (1,NbDays) +sigma*sqrt (DeltaT) *DeltaZ+... 
(muJ-0.5*sigmaJ~2) *PoissonG+sigmaJ*... 
PoissonJumps ; 

Increment s=(mu-Div-0.5*sigma”2) *DeltaT*ones(1,NbDays)+... 
sigma* sqrt (DeltaT) *DeltaZ; 

SolJumps=exp (cumsum( [log (X0) ,IncrementsJumps] ,2)); 

Sol=exp (cumsum([log(X0) ,Increments] ,2)); 


sGraph of the trajectories with and without jumps 
figure 

hold on 

plot (0:1:NbDays,SolJumps,0:1:NbDays,Sol) ; 
legend(’With jumps’ ,’Without jump’); 

hold off 


end 
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Figure 8.3 Simulations with and without jumps 


Thus, we can draw a graph showing two trajectories resulting from the same variables Z, 
one with jumps and the other without jump. Figure 8.3 illustrates this process with and without 
jumps. 


Notes and Complementary Readings 


For a complementary review of literature on stochastic calculus, the reader can consult the 
following references: Bjork (1999), Choe (1983), Demange and Rochet (2005), Lamberton 
and Lapeyre (1992), Merton (1992), Neftci (2000), Kloeden and Platen (1992), Kloeden, 
Platen and Schurz (1997) and Oksendal (2003). 

For a complementary reference on numerical schemes using MATLAB, one can read Borse 
(1997). 
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General Approach to the Valuation 


of Contingent Claims 


This chapter presents the basis for valuing contingent claims such as financial options. We 
begin with a brief definition of simple options, the most widely known contingent claims. 


Definition An option is a financial security or a real asset that gives the right to its 
owner/holder to buy or to sell another asset (called the underlying asset) at a specific date 
(maturity date or exercise date) or for a given period, and at a predetermined price (called 
strike price or exercise price). 


Many types of options, explicit or implicit, exist. A call option allows its owner to buy the 
underlying asset at the strike price before or at maturity date and a put option allows its owner 
to sell the underlying asset at the exercise price before or at the maturity date. Figures 9.1 and 
9.2 depict the payoffs of an option at maturity date as a function of the price of the underlying 
asset. 

The option is called European if it can be exercised only at the maturity date. An option is 
said to be American if it can be exercised at any time before or at the maturity date. 

In addition to these relatively simple options, there are more complex options which depend 
on the complexity of the choice of the underlying assets and the payoff structure. One popular 
complex option are the so-called Asian options whose payoffs depend on all the underlying 
asset price trajectories. Options whose payoffs depend on the trajectories of their underlying 
assets prices are called path-dependent options. 


Call 


Strike price Sos 


K 


we 


Asset price 


Figure 9.1 Call option 
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Figure 9.2 Put option 


We begin with the simple binomial pricing model to present the intuition of a hedge portfolio 
and the “risk-neutralized” principle (term used in Ingersoll (1987)) or risk-neutral pricing. Then 
we continue with the Black-Scholes-Merton partial differential equation. We also introduce 
the sensitivity coefficients called Greeks. Afterwards, we present the probabilistic approach of 
contingent claims valuation or the equivalent martingale approach using stochastic differential 
equations. The latter approach will form the backbone of the next chapter, which is on the 
pricing of complex options with no known analytical closed form solutions. 


9.1 THE COX, ROSS AND RUBINSTEIN (1979) BINOMIAL 
MODEL OF OPTION PRICING 


To grasp the intuition in the famous Black and Scholes formula to value European options, 
we first present the binomial model for option valuation that was developed by Cox, Ross and 
Rubinstein in 1979. This model will allow the reader to understand the risk-neutralized or risk 
neutral concept. 

Consider a government zero-coupon bond that yields a risk-free rate r; this represents a 
risk-free investment for its owner since it generates a sure return (same payments regardless 
of the state of the economy). We also assume the existence of a risky asset that follows a 
multiplicative binomial process in discrete time. This entails only two states to complete the 
economy (good or bad), with the asset price increasing if the good state occurs and decreasing 
if the bad state happens. The other assumptions of the model are: 


9.1.1 Assumptions 


1. In the good state, the asset price increases at a rate u with probability p, and in the bad 
state, it decreases at a rate d with probability 1 — p,. 

2. The asset price today is S. 

3. The asset doesn’t pay dividend. 

4. The risk-free rate r is positive and constant with d < r < u. 
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Schematically, the asset price’s process over one period can be represented as follows: 
Sy, =(1+u)S prob. p, 
a 
S 9.1 
3 (9.1) 
Sa =(1+d)S_ prob. 1-— p,. 


9.1.2 Price of a Call Option 


We consider a call option on asset S (underlying asset) described above. The case of a put 
option can be developed in a similar way. The option’s strike price is K and the option’s price 
is Call. This means that at the end of the period the payoffs of the option are 


C, = max(0, S, — K) = max(0, (1+ u)S — K) (9.2) 
in the good state and 
Ca = max(0, Sg — K) = max(0, (1+ d)S — K) (9.3) 


in the bad state. Schematically, we can represent the evolution of the option’s value the 
following way: 


C, = max(0, (1 +u)S — K) prob. p, 


A 
9.4 
Call N (9.4) 
Ca = max(0,(1+d)S — K) prob. 1 — p,. 
We will show that it is possible to reproduce future payments of the option with the right 
combination of risk-free bonds and the underlying asset. In order to do that, we consider 


the following portfolio consisting of A asset and an amount B of debt (risk-free bond). The 
portfolio’s process is the following: 


A(1+u)S+(1+r)B  prob.p, 


ra 
AS+B\. (9.5) 


Ad+da)S+(1+7)B  prob.1 — p,. 
In order to reproduce the call, we require that 


Ad+wS+0+r)B=Cy, 


AU + d)S +(1+r)B = Cu. See 
Which is equivalent to solving the following system of equations: 
(+u)S 1+4+r A) (Cy 
(eee oa) ee) 
The solution is 
Cym 1 Ca — (1 + d)Cu 
= 4 and Sy Sas Bae ae, (9.8) 


A = ——— 
(u —d)S (u =d) +r) 


To rule out arbitrage possibilities, the portfolio’s value must be the same as the call’s value 
(law of one price). No arbitrage means no free lunch, i.e., if two assets offer the same future 
payments, they must have the same price today, otherwise one can buy the cheaper one and 
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sell the expensive one. In other words, we rule out the possibility of being able to have positive 
gains out of zero investment. Thus 


Cy — Ca A +u)Ca — (1 + d)C, 


Call = AS + B = S+ : 9.9 
R (u— a)s ddr) aie 
which implies 
r—d u—r 
Cu + =C 
Call = uzdu V ucdd (9.10) 
l+r 


This expression can be written as 


PnCu + ad 7 Pn)Ca r-d 


Call = à where Pn 
l+r u—d 


: (9.11) 


Pn and 1 — p, are the risk-neutral probabilities. Note that these probabilities don’t depend on 
the initial physical and real probabilities p, and 1 — p,. In fact, these risk-neutral probabilities 
are obtained as a transformation of the real probability space into a new probability space, 
called the risk-neutral measure. We will come back later to this point. 


Example 1.1 Consider the case where 1 + u = 1.2, 1 + d = 0.80, 1 + r = 1.05, 
S = $ 45 and K = $ 40. With these values, we compute 


C, = max(0, 1.2 x 45 — 40) = 54 — 40 = $ 14, 


Ca = max(0, 0.8 x 45 — 40) = $ 0, 


and 
1.05 — 0.80 
n = => = 0.625, 1 — p, = 1 — 0.625 = 0.375. 
12—0.80 á 
Which implies 
KOM 1— p,)C 0.625 x 14 + 0.375 x 0 
O E FC = Pada _ ee ” | Z $ 8.33. 


l+r 1.05 


Remark 1.2 Probability p, doesn’t appear in these formulas. Thus, even if investors have 
different subjective probabilities for the up and down movements, they do agree on the relation 
between Call, S and r. So the option’s value doesn’t depend on the investors’ attitude toward 
risk. 


Remark 1.3 The value of the option depends only on the underlying asset’s value, and doesn’t 
explicitly depend on other factors like the market portfolio, for example. 
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Remark 1.4 The quantity 
r~d 
u—d 
can be considered as a probability. It is the value that p, must have at equilibrium as if investors 
are risk neutral, no where, we make the assumption that agents are risk neutral. The risk-neutral 
pricing implies a probability measure transformation using the concept of market price of risk 
discussed in the next section. Indeed, the market price of risk allows us to transform the return 
in the real world into a risk-free return under the risk-neutral probability. In other words, the 
expected return in a risk-neutralized economy is r. To see this, let us perform the following 
computation 

— prA+wS + — p+ a)s 


l+r= 5 =p,i+u)+( — p.) +d). (9.13) 


Pn = (9.12) 


Which implies 

r—d 
u—d 
from which we deduce the risk-neutralized principle also called risk-neutral. 


Pr = = Pn, (9.14) 


9.1.3 Extension To N Periods 


For the case of a binomial tree with two periods, the computation of the call price is carried 
out as follows. We first specify the process of the underlying asset’s price: 


Suu = ad F u} S 
S, = (1+ u)S 
S \ j Sua = (1 + u)(1 + d)S (9.15) 
Sa = (1 + d)S 
X 2 
Saa = (1+d)S 
The call option price process will be 


Cuu = max(0, (1 + u)?S — K) 
N 
Call Cua = max (0, (1 +u) + d)S — K) (9.16) 


Caa = max(0, (1 + d)*S — K) 
From the preceding analysis we have 


= PnCuu F (d pä Pn)Cua 
E l+r 


Cu ; (9.17) 


= PnCud + ad ai Pn)Caa 
1+r ; 


Ca (9.18) 
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Substituting these two expressions in 


PnCu + ad = Pn)Ca 


Call = ; (9.19) 
l+r 
we get 
(Pn) Cun + 2pPn(l g Pn)Cua + ad i Pn) Caa 
Call = 
(l+r) 
(Pn) max(0, (1 + u)’S — K) 
+2pn(1 — pn) max(0, (1 + u)U + d)S — K) 
+(1 — pn}? max(0, (1 +d}?S — K) 
d+r}? 
2 i l i , 
Ejo (PaA = pr? max(O, A + wy + APIS — K) 
E d+ry 
(9.20) 
By induction, we can generalize to N periods 
DA G) (Pa) — pr)" max(0, (1 + ui + d)N-IS — K) 
Call = i (9.21) 


CESS 


Moreover, expression max(.) can be eliminated by finding the smallest integer value a such 
that: 


Luda S >K, (9.22) 
that is 
a > In(K /(S(1 +.d)"))/In((1 + u)/ + d)), (9.23) 
in this case 


S a (Pa) F: py IG + u) (1 + dDNTIS = K) 


(1 +r)" 
For a big value of N, the formula converges to the Black-Scholes formula that we study later in 


this chapter. Now, in order to code these formulas in MATLAB, we use MATLAB’s functions 
ceilQ) and nchoosek(). 


Call = 


(9.24) 


function Rep=Binomial (N,u,d,r,S0,Strike,T) 
$Function using binomial trees in order to compute 
$a call option’s value. 

SN: Number of periods 

Su: Increasing rate 

$d: Decreasing rate 

r: Risk free interest rate 

%S0: Stock’s initial value 

$Strike: Option’s strike price 

$T: Maturity 


sComputation of the interest rate per period 
InterestRate=(1+r)*(T/N) -1; 
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SProbability of a up movement 
p= (InterestRate-d) /(u-d); 


a=ceil (log (Strike/ (SO* (1+d) *N) ) / (log((1+u) /(1+d)))); 


sComputation of expected payoffs at maturity 
ExpectedPayoffs=0; 
for j=a:N 
inc=nchoosek (N,j) *p* (j) * (1-p) * (N-3) * ((14+u) * (5) *... 
(1+d) * (N-3) *SO-Strike) ; 
ExpectedPayoffs=ExpectedPayoffs+inc; 


end 
Rep=ExpectedPayoffs/ (1+InterestRate) “N; 


end 


For a put option, Put, we find the largest integer a such that 
(1+u)"+d)*~“S < K, thatis,a < In(K /(S + d)%))/In(1 +u)/+d)), (9.25) 
which gives: 


Jio (*) (pny = PNI (K — (1+ u) (1 + DANTIS) 


Put = 
i (+r 


(9.26) 


Example 1.5 We consider a call option with strike price K = $ 50 on an asset whose 
price today is S = $ 48. The maturity of the option is t = 55 days which corresponds to 
5 = 0.151 year . The risk-free interest rate is r = 3% per year. To value this option, we 
consider a two-period binomial tree, N = 2, with 1 + u = 1.25 and 1 + d = 0.80 per period. 


We first compute the one period interest rate as follows: 
1 +F = (1.03) = (1.03)™5"? = 1.0022. (9.27) 
The option price process is therefore 


Cun = max(0, 1.25? x 48 — 50) = 25 


Cu 
J N 
Call Cug = max(0, 1.25 x 0.8 x 48 — 50) = 0 (9.28) 
Np 
Ca 
N 2 
Caa = max(0, 0.8° x 48 — 50) = 0 
and 
T-d 1.0022 — 0.80 
Pn = = = 0.4493. (9.29) 


u—d 1.25—0.80 
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Thus 
Cu = (PnCuu + (1 = Pn)Cua)/ +7) = $ 11.21 (9.30) 
Ca = (PnCua + (1 = pn)Caa)/ +7) = $0 (9.31) 

and 
Call = (pnCy + (1 — pn)Ca)/A +P) = $ 5.02. (9.32) 


9.2 BLACK AND SCHOLES (1973) AND MERTON (1973) OPTION 
PRICING MODEL 


In this section, we derive the fundamental equation to value contingent claims also known 
as the Black-Scholes-Merton partial differential equation which represents the state variables 
approach. 


9.2.1 Fundamental Equation for the Valuation of Contingent Claims 


Consider an asset S whose price follows a geometric Brownian motion process as follows: 
dS(t) = wS(t)dt + o S(t)dW (t). (9.33) 


In this stochastic differential equation, u represents the expected instantaneous rate of return 
of the asset, o the instantaneous volatility of its returns, and W is a standard Wiener process. 

We consider a contingent claim on asset S, which means that the asset S is the underlying 
asset. The price of the contingent claim on underlying S at date f is denoted by P(S(d), t). We 
also suppose that the maturity of this contract is T, thus the terminal value of the contingent 
claim is P(S(T), T). 

If the contingent claim is a European call option with maturity T and strike price K, then 
its terminal value is 


P(S(T), T) = max(0, S(T) — K), (9.34) 
however, if it is a European put option with same maturity and strike price, its value will be 
P(S(T), T) = max(0, K — S(T)). (9.35) 
From Ito’s lemma, the contingent claim price’s process, P(S(t), t), can be expressed as: 


dP(S(t),t) = ore Paseo + PEOD a 
1 PSA, 3) 
E 2 S(tydt 
2 se t) aP(S(t), t) 
os) EA PREON 


S(t 
EP + usS(t) 7 


1 a? 
+50°S( 


dP(S(t), t) 
as 


P(S(t), t) 
352 Jar 


o S(t)dW (t). (9.36) 
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From this stochastic differential equation, the expected instantaneous rate of return and the 
instantaneous volatility of the returns of the contingent claim are: 


E| 

g = HL 
aP(S(t), t) OP(S(t),t) 1, 50° P(S(t), t) 

coop Ngee ae 0.37) 
P ; . 

aP(S(t), Desa) 

op= =N ——— (9.38) 

P 
Which allows us to rewrite the process of P(S(t), t) as 

dP(S(t), t) = wp P (S(t), t)dt + op P (S(t), thdW(t). (9.39) 


We now form a portfolio with w fraction of the contingent claim and (1 — @) in asset S. Let 
us call V the value of this portfolio. The return of this specific portfolio is: 


dV(t) dP(S(t), t) dS(t) 
=o +(1-—o) 
V(t) P(S), t) S(t) 
= [oup + (1 — w)u]dt + [wop + (1 — wo ld W (t). (9.40) 


The return of the portfolio would be instantaneously risk-free if the diffusion term was zero, 
that is, if the coefficient of dW was equal to zero. This implies 


wop+(1-—w)o =0 (9.41) 


or also 
o 


(9.42) 


w = . 
O — Op 


Since the portfolio is risk free, its expected return must be equal to the risk-free interest rate 
r, SO 
dV(t) _ 
V(t) 
Comparing this expression to equation (9.40), and using the expression for w given by equation 
(9.42) above, we get 


rdt. (9.43) 


x 


= orp +1—o)u 
oO 


Op 
= Mp 
O — Op O — Op 
= Ee (9.44) 
O — Op 
or also 
r(0 —op)=OCup—oph = O(up—r)=op(u—r). (9.45) 
Thus 
an aye oc ead (9.46) 
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This is the market price of risk. It means that the risk premiums divided by their risk levels, 
known as Sharpe ratios, for the underlying asset and the contingent claim are equal. Note that 


jie (= —\o =h (9.47) 


o 
which means that the market price of risk allows us to adjust the real return to obtain the 
risk-free return in the risk-neutral world. 

Substituting equations (9.37) and (9.38) into equation (9.46), we get 


1 2020P aP aP . 5 
20 S g + eS Ze t 5, rP _ pr 


f 9.48 
Eso - (9.48) 
Rewriting the equation and simplifying gives 
1 323P dP oP 
S S rP =0. 9.49 
ee a oer mon 


This last equation (9.49) is the celebrated Black-Scholes-Merton partial differential equation 
and represents the fundamental equation for the valuation of contingent claims with terminal 
condition P(S(T), T). 

This is an example of a heat equation well known in physical sciences. Under specific 
boundary conditions, it is possible to solve it yielding a solution in exact analytical form. This 
is the case, for example, for the European call and put options. 


9.2.2 Exact Analytical Value of European Call and Put Options 


For a call option on the underlying asset S with strike price K and maturity T, the solution to 
equation (9.49) with boundary condition 


P(S(T), T) = max(0, S(T) — K), (9.50) 
gives the following formula for the price of the call option 
Call = S(0)N (d1) — Ke~"" N (dp), (9.51) 


with 


In(S(0)/K)+rT 1 
= VT 
en 


where N(.) is the normal cumulative distribution function, o the instantaneous volatility of the 
underlying asset returns and r is the risk-free interest rate. 

Although MATLAB’s Financial Toolbox contains valuation functions for options and sensi- 
tivity coefficients in the Black-Scholes world, for pedagogical purposes, the MATLAB program 
used to compute the price of an European call option using the Black-Scholes formula is: 


dı d =d; —oVT, (9.52) 


function rep=CallBS(S0,K,T) 

$Function computing the price of a call option using 
SBlack-Scholes analytical formula 

$S0: Initial price of the underlying asset 

SK: Strike price of the option 

$T: Maturity of the option 
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SParameters initial values 
sigma=0.1; 

r=0.06; 

div=0.00; %Dividend rate 


sComputation of di’s 
d1=(log(S0/K) +(r-div) *T) /(sigma*sqrt (T) )+sigma*sqrt (T) /2; 
d2=d1-sigma*sqrt (T); 


rep=S0*exp (-div*T) *normcdf (d1) -K*exp (-r*T) *normcdf (d2); 


end 


For a put option on the underlying asset S with strike price K and maturity T, the boundary 
condition is 


P(S(T), T) = max(0, K — S(T)). (9.53) 
The solution to equation (9.49) gives rise to the analytical formula for the put option price 
Put = Ke~’? N(—d>) — S(O)N(—d)), (9.54) 


with dı and d, defined above. 
The MATLAB program to compute the price of a European put option using the Black and 
Scholes formula is: 


function rep=PutBS(S0,K,T) 

sFunction computing the price of a put option using 
SBlack-Scholes analytical formula 

%S0: Underlying asset initial price 

%K: Option’s strike price 

$T: Option’s maturity 


SParameters initial values 
sigma=0.2; 

r=0.05; 

div=0.00; %*Dividend rate 


SComputation of di's 
d1=(log(S0/K) +(r-div) *T) /(sigma*sqrt (T) )+sigma*sqrt (T) /2; 
d2=d1-sigma*sqrt (T); 


rep=K*exp (-r*T) *normcdf (-d2) -S0*exp (-div*T) *normcdf (-d1) ; 


end 


Example 2.1 Let a European call on the underlying asset S have a strike price K = $ 50 
and a maturity of 4 months. Thus, in years, T = 4/12 year. Today’s price of the underlying 
asset is S(O) = $ 60 and its volatility is ø = 0.375. The risk-free interest rate is r = 3% per 


year. 
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With these data, we compute dı = 0.9965 and dz = 0.9965 — 0.375,/4/12 = 0.7800. 
This implies N(d,) = 0.8405 and N(d2) = 0.7823, and finally the call and put options’ 
prices are given by the Black and Scholes formula: 


Call = 60 x 0.8405 — 50 x e79-3*@/I2) x 0.7823 


= $ 11.7038, (9.55) 
Put = 50 x e7412 x (1 — 0.7823) — 60 x (1 — 0.8405) 
= $ 1.2067. (9.56) 


Remark 2.2 It is important to notice that the underlying asset’s expected rate of return ju 
doesn’t affect the value of the options in the Black and Scholes formula. 


Property 2.3 European options put-call parity: There exists a relation between the call and 
put option prices called the put-call parity. 

One can show it by constructing the following portfolio: buy one share of the stock and 
one European put option and sell one European call option. The two options have the same 
maturity T and the same underlying stock and strike price K. 

At maturity, we have the following two cases: either S < K or S > K. When S < K, the long 
Stock gives the ex-dividend price (price just after dividends are announced) of the stock S plus 
the dividend received over the period, the long put gives K — S and the short call gives 0, 
which sums up to K plus the dividend. 

When S > K, the long stock gives S plus the dividend received over the period, the long put 
gives 0 and the short call gives —(S — K), which sums up to K plus the dividend. In either case, 
the total payoff at date T is K plus the dividend. The law of one price requires the portfolio 
value today to be equal to the present value of K and the dividend, which yields the put-call 
parity relationship. This relation is defined as follows: 


Put — Call + S = Div + Ke’, (9.57) 


where Div is the discounted amount of dividend paid by the underlying asset. 


9.2.3 Hedging Ratios and the Sensitivity Coefficients 


In hedging and pricing derivatives or contingent claims, the concept of Greeks or derivatives’ 
risk factors is fundamental. Below we summarize some of these risk factors, also called Greeks. 
Note that these Greeks are computed using the Black and Scholes formula by applying the 
mathematical rules of partial differentiation of integrals or Leibnitz’s rule. 
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DELTA 


The hedge ratio DELTA measures the variation in the option’s price with respect to a movement 
in the underlying asset’s price. It can be computed as: 


dCall 
A= ag N(d,) > 0, (9.58) 


where 


_ n(S/K)+rT 1 
di = F + 5ovT. (9.59) 


A portfolio containing a short proportion —1/A in the call option and a long position in one 
unit of the underlying asset gives a hedged portfolio, that is, a risk-free portfolio. 
Similarly, the put option’s hedge ratio DELTA is 


Rae E, (9.60) 
age PVE 


Example 2.4 We consider a call option and a put option on a stock S. These two options 
have the same strike price and the same maturity. Let a portfolio comprise of A stocks, a 
number Acan of call options and a number Apu of put options, that is 


V = A x S + Aca X Call + Apu x Put, (9.61) 


where S, Call and Put are the prices of the stock, the call option and the put option. The 
position A of portfolio V is 


_ av 


~ as 
AK dCall JaN Put 
= all X ——— ut X = 
Call aS Put 3S 


= A + Acan x N(d1) + Apu x (N (d1) — 1). (9.62) 


Numerically, let 100 stocks (A = 100) and N(d1) = 0.7437, then how many call options 
are needed in the portfolio in order to have A = 0 if we assume that there is no put option 
in the portfolio (Apu = 0)? In this case, 


A 


A =0= 100+ AcaN(di) = 100 =—AcuN (dı). (9.63) 
Which implies 
—100 
Acal = ———— = —134.46 9.64 
c 0.7437 eee) 


call options (short positions or short sales of call options). 
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VEGA 


VEGA measures the sensitivity of the option’s price to a movement in the volatility of the 
underlying asset’s returns. It is equal to 
dCall  dPut 
V= Zo SVT N' (dı) > 0, (9.65) 


ðo o 


where 


— In(S/K)+rT 1 
d= T + 50VT (9.66) 


and N’(.) is the first derivative of N(.). 


GAMMA 


GAMMA measures the sensitivity of the option’s DELTA to a variation in the asset price. It 
is the second derivative of the option’s price with respect to the price of the underlying asset 
and is computed as 


d?Call  a°Put  N'(dı) 
3S? aS? So JT 


> 0, (9.67) 


where 
In(S/K T 1 
grae bee T 
oVvT 2 


and N’(.) is still the first derivative of N(.). 
With MATLAB, these coefficients are computed as follows: 


(9.68) 


function Greeks 
$Function computing the sensitivity coefficients for 
$a call option 


sParameters’ initial values 


r=0.03; 
Strike=50; 
S0O=60; 
sigma=0.375; 
T=4/12; 


sComputation of di's 
di=(log(S0/Strike) + (r+sigma*2/2)*T)/(sigma*sqrt (T) ) ; 
d2=d1-sigma*sqrt (T); 


$Computation of the coefficients 
Delta=normcdf (d1) 
Vega=S0*sqrt (T) *normpdf (d1) 
Gamma=normpdf (d1) / (SO*sigma*sqrt (T) ) 


end 
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we find: 
dı = 0.9965 
A = 0.8405 
y = 8.4110 
Tr = 0.0187. 


Example 2.5 If we take the same parameters as in Example 2.1, then, for the call option, 


The other sensitivity coefficients 


In addition to the three coefficients above, we can compute the sensitivity of the option’s price 


to the strike price (KAPPA x): 


dCall ; 
ae —e™T N(do) < 0, 
the sensitivity of the option’s price to the passage of time (THETA @) 


ge Ena ee NSO 
— =S———— > VU, 
aT JT 1 e 2 


and the sensitivity to the risk-free interest rate (RHO p) 


aCall 
pe >. —TKe~™ N(d) > 0, 
g 


where 


In(S/K ‘T 1 
Se Ee ee, 
oVT 2 
With MATLAB, these coefficients are computed as follows: 


dı 


function Greeks2 
%Function computing the coefficients Kappa, Theta, Rho 
sfor a call option 


%Parameters’ initial values 


r=0.03; 
Strike=50; 
S0=60; 
sigma=0.375; 
T=4/12; 


%Computation of di’s 
d1=(log(S0/Strike) + (r+sigma*2/2)*T)/(sigma*sqrt (T)); 
d2=d1-sigma*sqrt (T); 


sComputation of the sensitivity coefficients 
Kappa=-exp (-r*T) *normcdf (d2) 


d =d,—cavT. 


(9.69) 


(9.70) 


(9.71) 


(9.72) 


Theta=S0*sigma/ (2*sqrt (T) ) *normpdf (d1)+Strike*exp (-r*T) *normcdf (d2) 


Rho=T*Strike*exp (-r*T) *normcdf (d2) 


end 
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Example 2.6 For the same parameters as those used in Example 2.1, for the call option, 
we find: 
dı = 0.9965 
dy = 0.7800 
dCall 
—— = —0.7745 
OK 
dCall 
= 43.4578 
oT 
dCall 
“= 12.9089. 
or 


9.3 DERIVATION OF THE BLACK-SCHOLES FORMULA USING 
THE RISK-NEUTRAL VALUATION PRINCIPLE 


This section characterizes the probabilistic approach to the valuation of contingent claims 
which seeks to solve systems of stochastic differential equations. This approach is also known 
as risk-neutralized or equivalent martingale. 


9.3.1 The Girsanov Theorem and the Risk-Neutral Probability 


Theorem 3.1 (Girsanov) Let W(t) be a Brownian motion vector having dimension d in the 
probability space 


(Q,F, P), (9.73) 


where Q is the sample space of results, F is the set of all possible events and P is the set of 
probability measures. 

Let qt) = (qi), q2(Ċ), - - + , qa(t)) be a vector of adapted processes (we have discussed this 
concept in Chapter 8) satisfying 


T 
f (qit) dt <œ Yi as. (9.74) 
0 


We define 


d t 1 t 
M,(t) = exp (> [ sowo- | oas). 0.75) 
i=1 


If M(t) is a martingale, then the process 
dW (t) = dW (t) — q(t)dt (9.76) 


is a Brownian motion of dimension d defined on the probability space (Q, F, Q) where Q is 
the equivalent probability given by dQ = M,dP. 


By virtue of this theorem, if S is a diffusion process defined by 
dS(t) = w(S(t), dt +a0(S(t), t)dAW (t), (9.77) 


General Approach to the Valuation of Contingent Claims 165 


where u and o satisfy Lipschitz’s continuity condition, then 
dS(t) = (u(S(t), t) + o (S(t), Og (t))dt + o (S(t), t)dW (t). (9.78) 
More specifically, assuming that S follows a geometric Brownian motion 
dS(t) = wS(tH)dt + o S(t)dW (t), (9.79) 
if we make the assumption that the risk-free interest rate r is constant, we can choose 


ga (9.80) 


o 


In this case, we can easily show that the process 


M,(t) = exp (J qdW (t) — zS gar) (9.81) 


is a martingale. From Girsanov’s Theorem 3.1 above, the process 


dW, = dW, — qdt 


= dW, - — Ë 


dt (9.82) 


is a Wiener process under the probability measure Q. The process S is thus 


r-u 
Oo 


dS(t) = (uso +oS(t) 


=rS(t)dt + osd A). (9.83) 


) dt + aS(t)dWit) 


The probability Q is called the risk-neutral probability. Under this measure, the asset has 
an expected rate of return equivalent to the risk-free rate. More generally, in a risk-neutralized 
world, the expected return of any asset is the risk-free rate r. 


9.3.2 Derivation of the Black and Scholes Formula Under The Risk Neutralized 
or Equivalent Martingale Principle 


Under the risk-neutral probability, the asset’s price can be rewritten under the form 


S(T) = SO DTW), (9.84) 


We consider a European call option on the underlying asset S with strike price K and maturity 
T. Under the risk neutralized principle, the option’s price is 


Call = E2[e~"? max(0, S(T) — K)], (9.85) 


where E2[.] is the expectation operator under the risk-neutral probability Q. 
To obtain the analytical formula for the option price, we compute this expectation which is 
in fact the computation of an integral. 


Call = E [e™"" max(0, S(T) — K)] 
=e f (S(T) — K) fsery(s)as, (9.86) 
K 
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where f‘s(7)(.) is the probability density function of S(T) under the risk-neutral probability. 
For the S(T) process, since Wr is a Wiener process, we can rewrite it as JTX where X 


follows a Gaussian law with zero mean and unit variance, N (0, 1). 
With this transformation, the call price is: 


CO 32 
Call = e`" f (SO)e"—DT+°VTX _ K) fy(x)dx, 
dy 


where dž is obtained by: 
, o2 MT. 
K < S(T) < œ => K < S(je""DTtHVTX < o, 
which implies 


In(K /S(0)) —rT + ŻZ 
pin MEISOVS TEE 2 oes, 


oVT 


Now, set Y = —X, this way, 


-œ < Y 
In(S(0)/K)+rT — SE 
< oT 
_ In(S(0)/K)+rT oo VT 
= IF 5 


= d2. 


And since Y and X have the same distribution, then 


2 d d 
Call =e S(O)e"- FF / eD fy (y)dy — eT K i fr(y)dy 
—oo —0o 
-rT @-Ż)T 5 -oVTy —rT 
= eT s(Qe"-5 f eT) fy (y)dy — eT KN (dh). 
—00 


To arrive at the final formula, we must compute the following integral: 


dy 
I= f e797) fy (y)dy 


[o6] 
1 n inin 
=> e ov %e Yl dy. 
V2 Jo 
We can write this integral as: 


1 f ie iy es 

I= ete TY /2+0 T/2q : 

V2 Joo á 
Making the following change of variable z = y + 0 JT, then 


-œ <Z— 0 T < d => —œ < z < d +0ov4T = dı 


(9.87) 


(9.88) 


(9.89) 


(9.90) 


(9.91) 


(9.92) 


(9.93) 


(9.94) 
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which implies 


I 1 f -2?/2q o?T/2 
= ——— r ze 
V2 Joo 
=e? TPN). (9.95) 
Substituting this expression in the equation for the option’s price, we obtain the desired formula 


Call = S(O)N (d1) — e "T K N (d2), (9.96) 


where 
ln(S(0)/K)+rT 1 
dı = +-ovT, 
i oVT 2 


This technique of computing the value of a contingent claim using a change of variable is 
very useful when working with a single underlying derivative depending only on one state 
variable. For contingent claims depending on two state variables, we must work with the 
bivariate normal density. However, most common problems in finance are multivariate and 
necessitate the use of the probabilistic approach or equivalent martingale approach described 
above. The use of Monte Carlo simulation techniques is probably the best tool to implement 
the probabilistic or equivalent martingale approaches. We extensively use the Monte Carlo 
method in the following chapters to value complex contingent claims. 


d =d,—aVT. (9.97) 


Notes and Complementary Readings 


For a complementary review of literature about the valuation of contingent claims, see Bajeux- 
Besnainou and Portait (1992), Bjork (1999), Bryis, Bellalah, Mai and de Varenne (1998), 
Clewlow and Strickland (1998), Dana and Jeanblanc-Picqué (2003), Duffie (2001), Haug 
(1998), Hull (2005), Luenberger (1998), McDonald (2003), Merton (1992), Musiela and 
Rutkowski (1997), Neftci (2000), Quittard-Pinon (2003), Taleb (1997), Wilmott (1998). 

It is important to note that there exists a vast literature on the valuation of contingent claims 
using the numerical approach of solving partial differential equations. For this methodology, 
the reader could consult Seydel (2002), Tavella (2002) and Wilmott (1998). 

The reader could also consult the following classic articles: Black and Scholes (1973), Cox 
and Ross (1976), Cox, Ross and Rubinstein (1979), Géman, El Karoui and Rochet (1995), 
Harrison and Kreps (1979), Harrison and Pliska (1981), Merton (1973). 

For a pedagogic introduction to the risk-neutral pricing or equivalent martingale concept, 
one can read Sundaram (1997), Baxter and Rennie (1996), and Chapter 25 of Hull (2005). 

For extensions of the basic Black-Scholes-Merton (1973) model of valuing options to 
multivariate forms, one can read: Margrabe (1978), Stultz (1982) for complex options, Fouque, 
Papanicolaou and Sircar (2000), Heston (1993), Hofman, Platen and Schweizer (1992), Hull 
and White (1987), Romano and Touzi (1997) for options with stochastic volatility, Bensaid, 
Lesne, Pagés and Scheinkman (1992), Hoggard, Whalley and Wilmott (1994), Leland (1985) 
for the valuation of options with transaction costs. 
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Pricing Options using Monte Carlo 


Simulations 


Since the seminal works of Black-Scholes-Merton (1973) on European options valuation, a 
great number of derivative products have been engineered. However, in most cases it is difficult 
to find closed form formulas for the pricing of these commonly encountered complex options. 
Monte Carlo simulation techniques have proved to be a flexible and handy alternative to price 
these options. 

In this chapter, we proceed as follows. First, we start by valuing simple options called plain 
vanilla options, and then we introduce stochastic interest rates and/or stochastic volatilities. 

Second, we estimate the value of American options using the Least-Square Linear Re- 
gression Method of Longstaff and Schwartz (2001) and the Dynamic Programming Technique 
with Stratified States Aggregation of Barraquand and Martineau (1995). 

Third, we value Asian and barrier options using the approaches developed by Kemna and 
Vorst (1990) and El Babsiri and Noel (1998). Finally, to estimate the sensitivity coefficients of 
options or the Greeks, we discuss the pathwise derivatives and the likelihood ratio techniques 
proposed by Broadie and Glasserman (1996) and present the retrieval of volatility method of 
Cvitanic, Goukassian and Zapatero (2002). 


10.1 PLAIN VANILLA OPTIONS: EUROPEAN PUT AND CALL 
10.1.1 Simple Simulations 


We assume that the asset S(t) follows the stochastic differential equation (Geometric Brownian 
Motion) we have studied in Chapter 8 under the risk-neutral probability: 


dS(t) =r Sdt +o S(t)dW(t), (10.1) 


where W is the Brownian motion under the risk-neutral probability. We will simulate 10 batches 
of 5000 paths each (NbTraj = 5000) to price a European put as well as a call. The option 
value corresponds to the average value of its discounted future payoffs under the risk-neutral 
probability. We will therefore reproduce the dynamics of future prices of the underlying asset 
using computers, and calculate next the future payoffs to be obtained by the option holder. 
The sample mean of these discounted payoffs is the value of the option contract. We use 
constant interest rate r = 0.05 and volatility ø = 0.2. In addition, we assume the underlying 
asset pays no dividend. Let’s show how to simulate the asset process S(t) in MATLAB and 
determine the value of European call and put options. To simulate a batch, we proceed as 
follows: 


function OptionMC(S0O,K,T) 
$Function to calculate the prices of European Put and Call options 
$using Monte Carlo simulations. 
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$S0: Initial price of the asset 
$K: Exercise price of the options 
$T: Time to maturity of the options 


$Initial parameters 
sigma=0.2; 

r=0 057 
NbTraj=5000; 
NbPas=100; 
DeltaT=T/NbPas; 


SVector of asset prices 
SPresent=S0sones (NbTraj,1) ; 
SNext=zeros (NbTraj,1); 


SLoop to simulate the paths 

for i=1:NbPas 
dW=sqrt (DeltaT) »randn(NbTraj,1) ; 
SNext=SPresent+ (r) «SPresent«DeltaT+sigmaxSPresent.+dW; 
SPresent=SNext ; 

end 


SCalculation of the options prices 
Call = exp(-r«T) «mean (max(0,SPresent-K) ) ; 
Put = exp(-r«T) «mean (max(0,K-SPresent) ) ; 


end 


We executed the program for the 10 batches using 5000 paths (NbTraj = 5000) and 
100 time steps (VbPas = 100). Tables 10.1 and 10.2 present the results. 

In the tables, BS denotes the prices obtained from the Black-Scholes formula discussed in 
Chapter 9. The errors shown in the tables are the standard deviation of the vector of the 10 
batches. These values are large. We could improve the simulations’ accuracy by performing 
more simulations. However, as we discussed earlier, other variance reduction techniques can 
be used. The results obtained using antithetic and control variables follow. 


Table 10.1 Price of a call option using simple simulations 


Call option 

Parameters BS Monte Carlo 
S(O) K T Price Price Error 

$ $ years $ $ $ 
90 100 1 5.09 5.04 0.15 
90 100 2 9.91 9.95 0.26 
100 100 1 10.45 10.41 0.15 
100 100 2 16.13 16.10 0.38 
110 100 1 17.66 17.66 0.23 
110 100 2 23.59 23.45 0.32 


Pricing Options using Monte Carlo Simulations 171 


Table 10.2 Price of a put option using simple simulations 


Put option 

Parameters BS Monte Carlo 
S(O) K T Price Price Error 

$ $ years $ $ $ 
90 100 1 10.21 10.28 0.11 
90 100 2 10.39 10.42 0.12 
100 100 1 5.57 5.60 0.12 
100 100 2 6.61 6.58 0.17 
110 100 1 2.79 2.74 0.08 
110 100 2 4.07 4.10 0.11 


10.1.2 Simulations with Antithetic Variables 


We will simulate again the process for S(t) with 10000 paths (5000 + 5000 antithetic) for 
each batch. For each generated W(t), we use —W (t) to obtain the second path. We slightly 
modified the previous program to incorporate this variance reduction technique. 


function OptionMCAnti(S0O,K,T) 

$Function to calculate the prices of European Put and Call options 
S$using Monte Carlo simulations with antithetic variables. 

$S0: Initial price of the asset 

SK: Exercise price of the options 

$T: Time to maturity of the options 


$Initial parameters 
sigma=0.2; 

r=0.05; 
NbTraj=5000; 
NbPas=100; 
DeltaT=T/NbPas; 


%Price vector of the asset 
SPresent=S0sones (2%*NbTraj,1); 
SNext=zeros (2*#NbTraj,1) ; 


SLoop to simulate the paths 

for i=1:NbPas 
temp=sqrt (DeltaT) *randn (NbTraj,1); 
dW= [temp;-temp] ; 
SNext=SPresent+ (r) «SPresent*DeltaT+sigmaszSPresent.+dW; 
SPresent=SNext; 

end 


$Calculation of the options prices 
Call = exp(-r+«T) «mean (max(0,SPresent-K) ) ; 


Put = exp(-r«T) «mean (max(0,K-SPresent) ) ; 


end 
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Table 10.3 Price of a European call option using antithetic 


variables in the simulations (MC Anti.) 


Call option 


Parameters BS MC Anti. 

S(O) K T Price Price Error 

$ $ years $ $ $ 

90 100 1 5.09 5.13 0.10 

90 100 2 9.91 9.86 0.17 
100 100 il 10.45 10.46 0.10 
100 100 2 16.13 16.10 0.20 
110 100 1 17.66 17.66 0.11 
110 100 2 23.59 23.60 0.17 


Table 10.4 Price of a European put option using antithetic 


variables in the simulations (MC Anti.) 


Put option 
Parameters BS MC Anti. 

S(O) K T Price Price Error 

$ $ years $ $ $ 

90 100 1 10.21 10.21 0.07 

90 100 2 10.39 10.40 0.09 
100 100 1 5.57 5.57 0.06 
100 100 2 6.61 6.59 0.06 
110 100 1 2.79 2.80 0.04 
110 100 2 4.07 4.10 0.08 


Tables 10.3 and 10.4 present the simulation results obtained using this program. The error 


is again measured by the standard deviation of the 10 batches. 


We observe from the tables that the precision of the results has been improved considerably. 
The prices obtained from the Monte Carlo simulations are more precise and the errors are 


smaller. 


10.1.3 Simulations with Control Variates 


Let’s now use control variates to increase the precision of the results. Using the above notation, 


we know that 


Y = S(T) 
E[Y] = SOeT. 
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We can then use the difference Y — E[Y] to improve our results. The program is a modified 
version of the Monte Carlo simulation program given previously. 


function OptionMCControl (S0O,K,T) 

S$Function to calculate the prices of European Put and Call options 
S$using Monte Carlo simulations with control variates. 

%S0: Initial price of the underlying asset 

SK: Exercise price of the options 

$T: Time to maturity of the options 


$Initial parameters 
sigma=0.2; 

r=0.05; 
NbTraj=5000; 
NbPas=100; 
DeltaT=T/NbPas; 


%Price vector of the asset 
SPresent=S0sones (NbTraj,1) ; 
SNext=zeros (NbTraj,1); 


$Loop to simulate the paths 

for i=1:NbPas 
dW=sqrt (DeltaT) *«randn(NbTraj,1); 
SNext=SPresent+ (r) «SPresentsDeltaT+sigmaszSPresent.+dW; 
SPresent=SNext ; 

end 


$Using the control variates technique for the call option 
X1l=exp (-r«T) «max (0,SPresent-K) ; 

EX1=mean (X1); 

Y=SPresent; 

EY=SO«exp (rT); 


cov1=((X1-EX1)'* (Y-EY))/ (NbTraj-1) ; 
var=((Y-EY)’«* (Y-EY) ) / (NbTraj-1) ; 
alphal=-covl1/var; 


$Calculation of the price of the call option 
PriceControll1=X1l+alphals (Y-EY) ; 
repCall=sum(PriceControl1) / (NbTraj) ; 


sUsing the control variates technique for the put option 
X2=exp (-r«T) «max(0,K-SPresent) ; 

EX2=mean (X2); 

Y=SPresent; 

EY=SO«exp (rT); 


cov2=((X2-EX2)'«(Y-EY))/(NbTraj-1) ; 
alpha2=-cov2/var; 
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Calculation of the price of the put option 
PriceControl2=X2+alpha2« (Y-EY) ; 
repPut=sum(PriceControl2) / (NbTraj) ; 


end 


We can now run our simulations for European call and put options using the control variates 
technique. Tables 10.5 and 10.6 present the simulations results. 
We observe that the prices obtained with Monte Carlo simulations are more precise and 
that the errors have been reduced. The choice of the control variables could be different. 
For example, we could use the sensitivity coefficients of options (the Greeks) as in Clewlow 
and Strickland (1997a). If one wishes to use the option sensitivity coefficient A, the control 


variable will look like 


NbPas— 


' 9P(T —t;, SD) 
2 “=~ eee 


AS(ti), 


Table 10.5 Price of a European call option using control variates 


in the simulations 


Call option 


Parameters BS MC (Control) 
S(O) K T Price Price Error 

$ $ years $ $ $ 
90 100 1 5.09 5.06 0.06 
90 100 2 9.91 9.95 0.10 
100 100 1 10.45 10.42 0.08 
100 100 2 16.13 16.15 0.10 
110 100 1 17.66 17.64 0.07 
110 100 2 23.59 23.58 0.10 


Table 10.6 Price of a European put option using control variates 


in the simulations 


Put option 

Parameters BS MC (Control) 
S(O) K T Price Price Error 

$ $ years $ $ $ 
90 100 1 10.21 10.20 0.06 
90 100 2 10.39 10.45 0.09 
100 100 1 5:57 5.54 0.08 
100 100 2 6.61 6.62 0.09 
110 100 1 2.79 2.81 0.08 
110 100 2 4.07 4.06 0.07 


(10.4) 
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where T — ti = T; represents the time to maturity at the i" step. Replacing the differential and 
subtracting the expectation, we obtain for a call option: 


NbPas—1 
Y-EYY]= Yo NAT — ti, Sto StAdW (t), (10.5) 
i=0 
and for a put option: 
NbPas—1 
Y-EY]= D> WGC — t, S- Do SAd t), (10.6) 


i=0 


where dı(., .) is an explicit function of the time to maturity and the asset price S. Here is the 
MATLAB program for the simulations using these control variates. 


function OptionMCControl2(S0,K,T) 

S$Function to calculate the prices of European Put and Call options 
Ssusing Monte Carlo simulations with control variates. 

sWe use the value of the coefficient Delta as control variable. 
%S0: Initial price of the asset 

SK: Exercise price of the options 

$T: Time to maturity of the options 


$Initial parameters 
sigma=0.2; 

r=0.05; 
NbTraj=5000; 
NbPas=100; 

Tau=T; 
DeltaT=T/NbPas; 


%Price vector of the asset 
SPresent=S0+ones (NbTraj,1) ; 
SNext=zeros (NbTraj,1) ; 


Vector of the control variables 
Yl=zeros (NbTraj,1) ; 
Y2=zeros (NbTraj,1) ; 


SLoop to simulate the paths of the asset and the control variables 
for i=1:NbPas 
dW=sqrt (DeltaT) *«randn(NbTraj,1); 
if i<NbPas 
Tau=Tau-DeltaT; 
di= (log (SPresent/K) + (r+sigma*2/2)*Tau) / (sigmasxsqrt (Tau) ) ; 
Y1=Y1+ (normcdf (d1)) .*«SPresent .«dWs*sigma; 
Y2=Y2+ (normcdf (d1)-1) .*«SPresent .*«dWsxsigma; 
end 
SNext=SPresent+ (r) «SPresent*DeltaT+sigmaszSPresent.+dW; 
SPresent=SNext ; 
end 
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%Using the control variates technique for the call option 
X1l=exp (-r«*T) «max(0,SPresent-K) ; 

EX1=sum (X1) /NbTraj; 

EY1=0; 


covl=((X1-EX1) '« (Y1-EY1)) / (NbTraj-1); 
varl=((Y1-EY1)’* (Y1-EY1) )/(NbTraj-1) ; 
alphal=-covl1/varl; 


xI 


$Calculation of the price of the call option 
PrixControl1=X1+alphals« (Y1-EY1) ; 
repCall=sum(PrixControl1) / (NbTraj) ; 


$Using the control variates technique for the put option 
X2=exp (-r«*T) *«max(0,K-SPresent) ; 

EX2=sum(X2) /NbTraj; 

EY2=0; 


cov2=( (X2-EX2) '« (Y2-EY2)) /(NbTraj-1); 
var2=((Y2-EY2)’* (Y2-EY2))/(NbTraj-1) ; 
alpha2=-cov2/var2; 


Calculation of the price of the put option 
PrixControl2=X2+alpha2« (Y2-EY2) ; 
repPut=sum(PrixControl2) / (NbTraj) ; 


end 


Tables 10.7 and 10.8 present the simulations results using these new control variables. 

We can clearly see from the tables that these new control variates produce more accurate 
results than Y = S(T). Hence, it is important to choose the appropriate control variates for our 
simulations. It is also possible to use several variance reduction techniques at the same time, 
i.e., combine control variates with antithetic variables or use multiple control variates. 


Table 10.7 Price of a European call option with the second 
control variate 


Call option 

Parameters BS MC (Control) 
S(O) K T Price Price Error 

$ $ years $ $ $ 
90 100 1 5.09 5.10 0.01 
90 100 2 9.91 9.89 0.03 
100 100 1 10.45 10.45 0.02 
100 100 2 16.13 16.11 0.04 
110 100 1 17.66 17.66 0.03 
110 100 2 23.59 23.61 0.06 
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Table 10.8 Price of a European put option with the second 
control variate 


Put option 

Parameters BS MC (Control) 
S(O) K T Price Price Error 

$ years $ $ $ 
90 100 1 10.21 10.21 0.01 
90 100 2 10.39 10.39 0.03 
100 100 1 5.57 5.58 0.01 
100 100 2 6.61 6.60 0.02 
110 100 1 2.79 2.78 0.01 
110 100 2 4.07 4.09 0.03 


10.1.4 Simulations with Stochastic Interest Rate 


In practice, it is often necessary to account for the stochastic nature of interest rates, and several 
interest rate models are available for that end. We discuss in more detail interest rate models in 
Chapter 11. Here, we present two models: Cox, Ingersoll and Ross (1985b) (CIR) and Vasicek 
(1977) (VAS). These two interest rate models are based on the following equation: 


dr(t) = (6 —r(t))dt +0,r(t)”dW,(t). (10.7) 


In the VAS model, y = 0.0 and y = 0.5 in the CIR model. In our simulations, we use the 
following values for the other parameters: 


k = 0.25 
0 = 0.05 
o, = 0.05 
r(0) = 0.05. 
In addition, we assume that 
corr(dW,(t), dWs(t)) = p.s, (10.8) 


where d Ws (t) is the Brownian motion characterizing the diffusion portion of the asset price 
process: 


dS(t) =r(t)S(dt + oS(t)dWs(t). (10.9) 
We need to generate 
~  (dWs 
dW = (aw ) N(O, A), (10.10) 


with 
a=( At a (10.11) 
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In order to do this, we calculate L knowing that A = LL" and simulate dZ ~ N(0, I) to 
obtaindW = LdZ. We have split the option’s time to maturity into 100 time steps. We use the 
quadratic resampling method to generate dZ, which will affect d W by construction. For this 
purpose, we will use the ReQuadratic program proposed in Chapter 5. Let’s define mz and Az 
that are respectively the theoretical mean and covariance matrix of dZ. Hence, we have 


mz = Ge (10.12) 


Az= (4 `) : (10.13) 


We use these variables when generating our stochastic processes. The MATLAB program used 
to simulate European call and put option prices is the following. 


function OptionIntStoch(S0,K,T,gamma) 

$Function to calculate the prices of the European call and put 
options 
$using Monte Carlo simulation and stochastic interest rate. 
%S0: Initial price of the asset 

SK: Exercise price of the options 

$T: Time to maturity of the options 

sgamma: Parameter of the interest rate 


$Initial parameters 
NbPas=100; 
DeltaT=T/NbPas; 
NbTraj=5000; 
sigma=0.2; 


SParameters of the interest rate 
kappa=0.25; 

theta=0.05; 

sigmaR=0.05; 

r0=0.05; 

rho=-0.2; 


SVectors of the interest rate values and the discount rates 
r = r0Ox«ones (NbTraj,1); 
rAct = zeros(NbTraj,1); 


Vector of the asset price 
S = SOxones (NbTraj,1); 


SCholeski Factorization of the covariance matrix 
L = chol([DeltaT,rhosxDeltaT;rhos«DeltaT,DeltaT])’; 


SLoop to simulate the steps 
for cptPas=1:NbPas 
DeltaZ = randn(2,NbTraj) ; 
DeltaZ = ReQuadratic(DeltaZ,zeros(2,1),eye(2)); 
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DeltaW = LxDeltaZ; 
S = S.«(1+rx«DeltaT+sigmasDeltaW(1,:)'); 


r = r+kappas (theta-r) «DeltaT+sigmaRs (r.*~gamma) .*DeltaW(2,: 


rAct = rAct + rx«DeltaT; 
end 
$Calculation of the price of the options 
PriceCall = mean(max(0,S-K) .*exp(-rAct) ) ; 
PricePut = mean (max (0,K-S) .*exp(-rAct)); 
end 


function Rep=ReQuadratic(Sample, MoyTheo, CovTheo) 
$Function performing the quadratic resampling. 

sSample: Simulated sample 

$MoyTheo: Theoretical mean of the variables 

SCovTheo: Theoretical covariance matrix of the variables 


Calculation of the sample distribution 
CovEmp=cov (Sample’) ; 

MoyEmp=mean (Sample, 2) ; 

LEmp=chol (CovEmp) ' ; 


sResampling based on the theoretical covariance matrix 
LTheo=chol (CovTheo) '; 

Sample=LTheos«inv (LEmp) «(Sample-... 

repmat (MoyEmp,1,size(Sample,2)))+... 
repmat (MoyTheo,1,size(Sample, 2) ) ; 


Rep=Sample; 
end 


We simulate 100000 paths (20 batches of 5000 simulations each) and calculate the standard 
deviation of the 20 batches. We use p, s = —0.20 and NbPas = 100. Tables 10.9 and 10.10 


present the simulations results for the CIR and VAS models. 


As in the previous section, the reader can use antithetic and control variables to obtain more 
accurate results. One suggested control variable is the Black-Scholes expression. We leave 


this as an additional practice exercise for the reader. 


Table 10.9 Price of a European call option with stochastic interest rates 


Call option 


Parameters VAS CIR 
S(O) K T Price Error Price Error 
$ $ years $ $ $ $ 
90 100 1 4.96 0.11 5.09 0.08 
90 100 2 9.66 0.12 9.80 0.12 
100 100 1 10.29 0.10 10.42 0.12 
100 100 2 15.84 0.18 16.01 0.16 
110 100 1 17.59 0.08 17.64 0.10 
110 100 2 23.31 0.16 23.50 0.16 
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Table 10.10 Price of a European put option with stochastic interest rates 


Put option 
Parameters VAS CIR 

S(O) K T Price Error Price Error 

$ $ years $ $ $ $ 

90 100 1 10.16 0.07 10.18 0.06 

90 100 2 10.38 0.07 10.32 0.06 
100 100 1 5.49 0.08 5.52 0.06 
100 100 2 6.58 0.08 6.56 0.06 
110 100 1 2.71 0.07 21D 0.07 
110 100 2 4.03 0.07 4.00 0.06 


10.1.5 Simulations with Stochastic Interest Rate and Stochastic Volatility 


In Section 10.1.4, we introduced two stochastic interest rate models. Indeed, the price of 
an asset can depend on many stochastic parameters. Modeling implies many errors such as 
dimensionality errors, measurement errors, etc. All these errors can be captured or subsumed 
by a stochastic volatility process. 

The object here is not to make an extensive treatment of different volatility processes and 
models or to study various related concepts (e.g., implicit volatility, volatility surface, volatility 
derivatives, etc.). Readers interested in these issues will find useful references in the notes and 
complementary readings section at the end of the chapter. 

In this section, we present a stochastic volatility model and use it to price plain vanilla 
options using Monte Carlo simulations. 

The volatility of an asset price will be a function of its value but will also be correlated with 
its return. We also assume a non-zero correlation between the stochastic interest rate and the 
asset volatility. 

The dynamic of the asset volatility is as follows 


dolt? = v(B — o (tdt + 0,0(t)dW,(t), (10.14) 


where dW, is a Wiener process such that 


corr(dW,(t), dWs(t)) = po.s = —0.5, (10.15) 
corr(d Ws (t), dW,(t)) = Por = Pas s.r = 0-1. (10.16) 


In the stochastic volatility equation, v is the adjustment speed, £ the long run mean of the 
process and oo the standard deviation of the process. In order to price European call and put 
options, we use the following parameter values to run our simulations 


09 = 0.20 
v = 1.25 
B = 0.04 


Oo = 0.20. 
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The following MATLAB program is a slightly modified version of the previous program in 
which, in addition of the stochastic interest rate models of CIR and VAS, we have added the 
stochastic volatility for the simulation. 


function rep=OptionIntVolStoch(S0,Strike,T,gamma) 

S$Function to calculate the prices of the European call and put 
options 

$using Monte Carlo simulations with stochastic interest rate 
sand stochastic volatility. 

$S0: Initial asset price 

SK: Exercise price of the options 

$T: Time to maturity of the options 

sgamma: Parameter of the interest rate 


S$Initial parameters 
NbStep=100; 
DeltaT=T/NbStep; 
NbTraj=5000; 
rho=-0.2; 
sParameters of the stochastic interest rate 
kappa=0.25; 
theta=0.05; 
sigmaR=0.05; 
r0=0.05; 
$Parameters of the stochastic volatility 
nus. 25:7 
beta=0.04; 
sigma0=0.2; 
sigmaSigma=0.2; 
rho2=-0.5; 
rho3=rhoxrho2; 
sVectors of spot interest rates and discounted rates 
r = r0xones(NbTraj,1); 
rAct = zeros(NbTraj,1); 
sVectors of asset and volatility values 
S = SOsones (NbTraj,1) 
sigma = sigma0sxones (NbTraj,1) ; 
%Choleski Factorization 
L=chol ( [DeltaT, rhos#DeltaT, rho2s4DeltaT;rho«DeltaT,DeltaT, rho3* 
DeltaT;...rho2«DeltaT,rho3s*DeltaT,DeltaT])'; 
sSimulation of the paths 
for cptpas=1:NbPas 
DeltaZ randn(3,NbTraj) ; 
DeltaZ = ReQuadratic(DeltaZ,zeros(3,1),eye(3)); 
DeltaW = LxDeltaZ; 
S = S.«(1+rs«DeltaT+sigma.«DeltaW(1,:)’); 
r = r+kappas (theta-r) *«DeltaT+sigmaRs« (r.*~gamma) .*«DeltaW(2,:)'; 
sigma = (sigma. ”~2+nux (beta-sigma.*2)+*DeltaT+... 
sigmaSigmasSigma.»*DeltaW(3,:)’).7(0.5); 
rAct = rAct + rx*DeltaT; 
end 
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Table 10.11 Price of a European call option with stochastic interest rate and stochastic volatility 


Call option 


Parameters VAS CIR 
S(O) K T Price Error Price Error 
$ $ years $ $ $ $ 
90 100 1 4.61 0.07 4.69 0.08 
90 100 2 9.37 0.13 9.48 0.17 
100 100 1 10.31 0.08 10.33 0.09 
100 100 2 15.87 0.17 16.10 0.17 
110 100 1 17.79 0.10 17.88 0.08 
110 100 2 23.56 0.11 23.79 0.19 


%Calculation of the prices of the European call and put options 
PrixCall = mean (max (0,S-Strike).xexp(-rAct)); 

PrixPut = mean(max(0,K-Strike) .*exp(-rAct)); 

end 


function Rep=ReQuadratic (Sample, MoyTheo, CovTheo) 
$Function performing the quadratic resampling 
sSample: Simulated sample 
sMoyTheo: Theoretical mean of the variables 
SCovTheo: Theoretical covariance matrix of the variables 
sCalculation of the parameters of the sample distribution 
CovEmp=cov (Sample’) ; 
MoyEmp=mean (Sample, 2) ; 
LEmp=chol (CovEmp) ’ ; 
S$Resampling based on the theoretical covariance matrix 
LTheo=chol (CovTheo) ’; 
Sample=LTheosinv (LEmp) + (Sample-... 
repmat (MoyEmp,1,size(Sample,2)))+... 
repmat (MoyTheo,1,size(Sample,2)); 


Rep=Sample; 
end 


This program is used to estimate European call and put option prices. Tables 10.11 and 10.12 
present the simulations results. The error is measured by the standard deviation of the result 
vector of the 20 batches. 

As in the section above, the reader can use antithetic and control variables to obtain more 
accurate results. One suggested control variable is the Black-Scholes formula. We leave this 
as an additional practice exercise for the reader. 


10.2 AMERICAN OPTIONS 


The feature of an American option is that it can be exercised at any time before its maturity. 
Given the fact that these options offer more flexibility to their holders, their price should be 
higher than the price of their equivalent counterpart European options. However, for a call 
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Table 10.12 Price of a European put option with stochastic interest rate and stochastic volatility 


Put option 
Parameters VAS CIR 

S(O) K T Price Error Price Error 

$ $ years $ $ $ $ 

90 100 1 9.77 0.08 9.81 0.06 

90 100 2 10.06 0.10 9.98 0.09 
100 100 1 5.46 0.06 5.49 0.08 
100 100 2 5.60 0.09 6.55 0.10 
110 100 1 2.98 0.09 3.01 0.07 
110 100 2 4.33 0.13 4.29 0.10 


option on a non-dividend-paying stock, the American and European options are worth the 
same; there’s no advantage in early exercise of a call in this case. 

There exists no simple formula to price American options. One may use binomial trees and 
examine at each node if it is optimal to exercise the option immediately or not. This simple 
approach can take a considerable amount of calculation time to obtain an adequate precision 
for options on several underlying assets. 

We present next two approaches to price American options. These two approaches use 
Monte Carlo simulations instead of binomial trees. The approaches are the Least-Squares 
Method of Longstaff and Schwartz (2001) and the Dynamic Programming Technique with 
Stratified States Aggregation of Barraquand and Martineau (1995). 


10.2.1 Simulations Using The Least-Squares Method of Longstaff 
and Schwartz (2001) 


When dealing with American options, it is not always easy to decide whether to exercise them 
immediately or keep them in the portfolio until their expiration date. The optimal strategy 
would be to exercise the option if the immediate payment is larger than the expected future 
payments, otherwise it should be kept. Let’s assume that the price of the stock is S(t) and the 
payoff function is P (S(t), t). Then the optimal strategy at time f = t4 is 


Yes if P(S(t1), tt) > En [P(S(), t0)|Fr] 


eae 3 , (10.17) 
No if P(S(t), t1) < En [P(S(), 2)1Fi] 


Exercise = | 


where E, [P (S (t2), t2)| F, ] is the expectation of future payments (of time t2 > tı) at time t = 
tı, and F, represents the available information set at t = t4. It is therefore essential to “know” 
the expected value of future payments in order to make an accurate evaluation of an American 
option. The Least-Squares method is a technique that enables us to perform this exercise. 

We start by generating M paths for the underlying stock price S. Next, for each path, we 
need to regress the future payoffs on basis functions F;, which depend on the stock price S. Let 
Y be the vector of future payoffs for the M paths and 1, F,(S) and F2(S) be the basis functions. 
We regress Y on these basis functions, which yields the expression 


E[Y |S] = œ + BF\(S)+ yF2(S). (10.18) 


184 Stochastic Simulation and Applications in Finance 


Table 10.13 Simulation paths 


Number t=0 t=1 f=2 t=3 


1 1.00 1.07 1.53 1.95 
2 1.00 0.76 0.78 0.71 
3 1.00 0.85 0.69 0.76 
4 1.00 0.96 1.01 0.97 
5 1.00 0.95 1.06 1.28 
6 1.00 1.59 1.26 1.07 
7 1.00 1.28 1.23 0.97 
8 


1.00 1.11 1.57 1.89 


This expression gives us an estimation of the expected value of future payoffs as a function of 
S. This expected value is effectively the value of holding on to the option. From this expression, 
we can decide if it is preferable to exercise the option immediately or to wait one more period. 
This procedure is reproduced backward from the maturity date to time t = 0. For each path, 
we find the optimal exercise date of the option. The price of the option is therefore the average 
of all discounted payoffs. 

To better illustrate the method, we use it to price an American put option. We assume 
S(O) = $1.0, the exercise price is K = $ 1.10, the time to maturity of the option is 3 years 
and the option’s potential exercise dates are 1, 2 and 3 years. If S(0) Æ $ 1.0, it is better to 
normalize the initial price to S(0) = $ 1.0 as the exercise price becomes K/S(0) to reduce the 
estimation errors in the regressions. We also assume the risk free interest rate to be r = 6% 
and M = 8 paths for simplification. Table 10.13 presents the simulation values of S. 

To use the Least-Squares Method, we need to start at maturity and go backward until we 
reach the initial time ¢t = 0. At time ¢ = 3, the holder of the option exercises it only if he gains 
from it. The payoffs matrix at time t = 3 is given by Table 10.14. 

This matrix shows the realized payoffs of an equivalent European option. Now we need to 
determine for which paths it is preferable to exercise the option at date t = 2. For t = 2, only 4 
paths must be considered since the other 4 paths would result in zero payoffs. The results are 
presented in Table 10.15 (Y represents the discounted expected payoffs). 

We use 1, S and S? as the basis functions. We therefore need to regress Y on these functions, 
which yields: 


E[Y |S] = —2.71 + 7.81S — 4.968”. (10.19) 


Table 10.14 Payoffs at t = 3 


Number En] AP t=3 


APIADNARWNE 
| 
| 
= 
2 
5 
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Table 10.15 Variables for the first regression 


Number S Y 
1 es a 
2 0.78 0.39*0.9418 = 0.3673 
3 0.69 0.34*0.9418 = 0.3202 
4 1.01 0.13*0.9418 = 0.1224 
5 1.06 0.00*0.9418 = 0.0000 
6 $ L 
7 = = 
8 = = 


To obtain the regression coefficients, we only need to compute the following matrix product 


(X'X)'x'Y = (—2.7077, 7.8084, —4.9566)', (10.20) 
where 
1 0.78 0.787 0.3673 
1 0.69 0.692 
X= > | andY = D0. (10.21) 
1 1.01 1.01 0.1224 
1 1.06 1.062 0 


From the regression equation, we evaluate the function E[YIS] for different values of S at time 
ER 

E[Y|S = 0.78] = 0.3672 

E[Y |S = 0.69] = 0.3202 

E[Y |S = 1.01] = 0.1225 

E[Y |S = 1.06] = —0.0001. 
We then compare these values to the payoffs resulting from an immediate exercise of the 
option (see Table 10.16). 


We note that it is preferable to exercise the option at date 2 for paths 3 and 5. For simulations 
2 and 4, the expected payoffs are higher when the option is not exercised. Then, taking into 


Table 10.16 Exercise decision at time t = 2 


Number Exercise Continuation 


0.32 0.36 
0.41 0.32 
0.09 0.12 
0.04 0.00 


AADNBRWNK 
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Table 10.17 Payoffs at date t = 2 


Number t=1 t=2 t=3 
1 — 0.00 0.00 
2 — 0.00 0.39 
3 — 0.41 0.00 
4 — 0.00 0.13 
5 — 0.04 0.00 
6 — 0.00 0.03 
7 — 0.00 0.13 
8 — 0.00 0.00 


Table 10.18 Variables for the second regression 


Number S Y 

1 1.07 0.00 

2 0.76 0.39*0.9418? = 0.3459 
3 0.85 0.41*0.9418 = 0.3861 
4 0.96 0.13*0.9418? = 0.1153 
5 0.95 0.04*0.9418 = 0.0377 
6 = a 

7 A a 

8 = = 


Table 10.19 Exercise decision at date t = 1 


Number Exercise Continuation 
1 0.03 —0.04 

2 0.34 0.38 

3 0.25 0.26 

4 0.14 0.11 

5 0.15 0.13 

6 = = 

7 = os 

8 = et 


account the payoffs of paths 3 and 5 at time 2, we obtain the following payoff matrix in Table 
10.17. 

We need to follow the same steps to obtain the payoffs at time t = 1. For t = 1, only 3 paths 
provide values of S greater than 1.10 (number 6, 7 and 8). We then need to consider all other 
5 paths in the regression, which yields Table 10.18. 

The regression gives: 


E[Y |S] = 1.38 — 1.285 — 0.0487. (10.22) 


Then we have to compare the immediate payoffs if the option is exercised and the expected 
payoffs if not. Table 10.19 presents the results. 

Here we observe that it is preferable to exercise the option immediately for paths 1, 4 and 5. 
We can therefore complete the option exercise decision table. Table 10.20 presents the results. 
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Table 10.20 Payoffs at t= 1 


Number t=1 t= 2 t=3 
1 0.03 0.00 0.00 
2 0.00 0.00 0.39 
3 0.00 0.41 0.00 
4 0.14 0.00 0.00 
5 0.15 0.00 0.00 
6 0.00 0.00 0.03 
7 0.00 0.00 0.13 
8 0.00 0.00 0.00 


The price of the American put option is obtained by taking the average value of all dis- 
counted payoffs. We obtain an option price of 0.1406. If the option was European, its price 
would have been 0.1065. The Least-Squares Method allows us to price the early exercise 
flexibility feature associated with American options; the value of that flexibility is 0.1406 — 
0.1065 = 0.0341. 

This method can be used in many other option pricing cases with early exercise possibility. In 
addition, it is possible to use many basis functions, which improve the precision. For example, 
Longstaff and Schwartz (2001) suggest using the Laguerre, Hermite, Legendre, Chebychev 
polynomials among others. 


MATLAB Program 


The MATLAB program to implement the algorithm of the Least-Squares Method is given 
below. This program includes a function to generate the paths and another function to perform 
the regression and price the option in a backward manner (one step at a time). 


function LSM 
$Function to calculate the price of an American option 
Susing the Least-Squares method. 


T=1; Maturity 

TimePresent=T; %Time to maturity 
NbStep=50; Number of steps. 

K=40; %Exercise price 

sigma=0.2; SVolatility of the asset 
NbTraj=50000; SNumber of paths 
DeltaT=T/NbStep; 

SqDeltaT=sqrt (DeltaT) ; 

r=0.06; 

SBegin=36; %Initial price of the asset 


šTo increase the precision of the regression, 
swe use an initial price of 1 

sand we decrease the exercise price consequently 
S0=1; % 

Strike=K/SBegin; 
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We generate the paths of the asset price 
S=GeneratePaths (NbTraj,NbStep,DeltaT,SqDeltaT,r,sigma,S0) ; 


sPayoff is a vector formed of the largest cash flows 
S$using the optimal strategy 

Payoff=zeros (2*NbTraj,1); 
Payoff(:,1)=max(0,Strike-S(:,NbStep+1)); 


$Loop for backwardation by step from the maturity 
for cptStep=1:NbStep-1 

SDiscounting of the payoff vector 

Payoff=exp (-r«DeltaT) «Payoff; 


TimePresent=TimePresent -DeltaT; 


$Calculation of the new payoff vector by deciding if it is 
optimal 
Sto exercise the option immediately. 
Payoff=BackwardStep (Payoff,Strike,TimePresent,NbTraj,r,DeltaT,... 
S(:,NbStep+1l-cptStep) ) ; 
end 


$Calculation of the option price 
Price=mean (exp (-r«*DeltaT) Payoff); 


$Calculation of the option price with the initial price of the 
asset 
disp (PricexSBegin) ; 


end 


function S=GeneratePaths (NbTraj,NbStep,DeltaT,SqDeltaT,r,sigma,S0) ; 
$Function to simulate the paths of the asset price 


dW=SqDeltaTsxrandn (NbTraj,NbStep) ; 
dW=cat (1,dW, -dW) ; 


Increments= (r- (sigma*2) /2) sDeltaT+sigmaxdWw; 
LogPaths=cumsum([log(S0) *ones(2s*NbTraj,1),Increments] ,2) ; 
S=exp (LogPaths) ; 

end 


function Payoff=BackwardStep(Payoff,Strike, TimePresent, 
NbTraj,r, ... DeltaT,S) 

$Function using the Least-Squares method to determine 

$if it is preferable to exercise immediately the option. 

SPayoff: Old vector of Payoff at time t+1 

$Strike: Exercise price of the option 

$TimePresent: time t 
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$NbTraj: Number of paths 

Sr: interest rate 

sDeltaT 

$S: Vector of asset prices at time t 


Vector containing the paths where the price of the 

asset is lower than 

šthe exercise price (otherwise it is preferable to exercise) 
SelectedPaths=(StrikesS) ; 


SMatrix of regressors 
X= [ones (2%NbTraj,1).*SelectedPaths, (S.*SelectedPaths),... 
(S.*SelectedPaths) .*2]; 


sVector of expected payoffs 
Y=Payoff.*«SelectedPaths; 


Regression to determine the coefficients 
A=inv (X!/ «X) «XX! xY; 


$Calculation of the payoffs values when the option is not 
exercised 
Continuation= (X*A) .*SelectedPaths; 


$Values when the option is exercised immediately 
Exercise=max(0,Strike-S) ; 


Paths, exercise decision and update of the Payoff vector. 
for i=1: (2+NbTraj) 
if ((Exercise(i,1)>0)&(Exercise(i,1)>Continuation(i,1))) 
Payoff (i,1)=Exercise(i,1); 
end 
end 


end 


We use this program to calculate the value of an American put option. We use 100 000 paths 
including 50 000 antithetic, 50 time steps per year (hence 50 possible exercise dates per year), 
a risk-free rate of 6% as well as an exercise price of 40$ and an initial price S(O) in {38, 40, 
42}. In Table 10.21, we show the results and compare them to those of the finite difference 
method presented in Longstaff and Schwartz (2001). 


Least-Squares Method with Multiple Underlying Assets 


The previous section introduced the Least-Squares Method for the case of one dimension. 
With only one underlying asset, it is relatively easy to understand the approach. However, this 
method is also useful for cases featuring several underlying assets and complex cash flows. We 
use the Least-Squares Method to price an American call option written on three underlying 
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Table 10.21 Prices of an American put option with the Least-Squares Method 


S(O) o T Finite difference Least-Squares 


years 
38 0.2 1 3.250 3.244 
38 0.2 2 3.745 3.725 
38 0.4 1 6.148 6.129 
38 0.4 2 7.670 7.675 
40 0.2 1 2.314 2.310 
40 0.2 2 2.885 2.878 
40 0.4 1 5.312 5.302 
40 0.4 2 6.920 6.895 
42 0.2 1 1.617 1.618 
42 0.2 2 2.212 2.211 
42 0.4 1 4.582 4.574 
42 0.4 2 6.248 6.239 


assets. Let’s consider the following processes for the underlying assets prices 


dSı(t) ~ 
= 0.05dt + 0.2dWi(t), (10.23) 
Si(t) 
dSo(t X 
2) _ 905d + 0.3dW(t), (10.24) 
So(t) 
dS3(t Š 
3) _ 9 05d + 0.2dW3(), (10.25) 
S3(t) 


with initial values $;(0) = 40. The payoff function for the American call option at time t is 
given by 


max(0, Sı(t) — 40, S2(t) — 40, $3(t) — 40). (10.26) 


Now we need to choose the basis functions to perform the regressions. There exists an 
infinite number of functions we could have used. Here we choose basis functions leading to 
polynomials of order less than or equal to 4. 


1 Sı S2 S3 (Si) 
(S2)? (S3)? Sy So S183 S253 
(S\)° (Hy (SY (S1)?S2 (S1? S3 
(S2) S1 (S2 S3 (SFS (SFS (S) (10.27) 


(S2)* (S3)* (SPS (SS (SPS 
(S2) S3 (SSi (SFS (SDS? (SS 
(S2)°(S3)? 


The MATLAB program used to obtain the results shown in Table 10.22 is given below. 


function Least2 

$Function to calculate the price of an American call option 
Susing the Least-Squares method. 

SHere the underlying asset is a basket of assets. 
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Table 10.22 Prices of an American call option on three underlying assets with the 
Least-Squares Method 


Exercise price ($) T (year) Option price ($) 


38 0.25 7.1555 
0.50 10.4924 
40 0.25 5.9368 
0.50 8.7276 
42 0.25 4.3501 
0.50 7.0876 


%SBegin: Vector of initial prices 
SBegin=40; 

$T: Maturity of the option 
T=0.25; 

SK: Exercise price of the option 
K=387 


S$Initial parameters 
TimePresent=T; 
NbStep=12; 
NbTraj=100000; 
DeltaT=T/NbStep; 
SqDeltaT=sqrt (DeltaT) ; 
r=0.05; 

S021; 

Strike=K/SBegin; 


SVectors to store the paths of then three underlying assets 
S1=GeneratePaths (NbTraj ,NbStep,DeltaT,SqDeltaT,r,0.2,S0) ; 
S2=GeneratePaths (NbTraj ,NbStep,DeltaT,SqDeltaT,r,0.3,S0) ; 
S3=GeneratePaths (NbTraj ,NbStep,DeltaT,SqDeltaT,r,0.4,S0) ; 


$Payoff is a vector formed by the highest cash flows 
Susing the optimal strategy 
Payoff=zeros (24«NbTraj,1) ; 


Payoff(:,1)=max([S1(:,NbStep+1) -Strike,S2(:,NbStep+1)-Strike,... 
S3(:,NbStep+1) -Strike, zeros (2«NbTraj,1)]')'; 


sLoop for the backwardation by steps starting from the maturity 


for cptStep=1:NbStep-1 
sDiscounting of the vector Payoff 
Payoff=exp (-r«DeltaT) «Payoff; 


TimePresent=TimePresent-DeltaT; 


SCalculation of the new vector Payoff with respect to the 
exercise 


o 


% decision. 
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Payoff=BackwardStep (Payoff, Strike,TimePresent,NbTraj,r,DeltaT,... 


S1(:,NbStep+1-cptStep) ,S2(:,NbStep+1- 
cptStep),... 
S3(:,NbStep+1l-cptStep) ) ; 

end 


sCalculation of the price and the standard deviation 
std(exp(-rs«DeltaT) *«Payoff) ; 
Price=mean (exp (-rxDeltaT) *«Payoff) ; 


$Calculation of the option price with the departure 
value of the asset 
disp (PricexSBegin) ; 


end 


function S=GenerePaths (NbTraj,NbStep,DeltaT,SqDeltaT,r,sigma,S0O) ; 
$Function to simulate the paths of the asset price 


dW=SqDeltaTsrandn (NbTraj,NbStep) ; 
dW=cat (1,dW, -dWw) ; 


Increments= (r- (sigma*2) /2) sDeltaT+sigmaxdWw; 
LogPaths=cumsum([log(S0) *ones(2sNbTraj,1),Increments] ,2) ; 
S=exp (LogPaths) ; 

end 


function Payoff=BackwardStep(Payoff,Strike, TimePresent, NbTraj,... 
r,DeltaT,S1,S2,S3) 

$Function using the Least-Squares method to determine 

$if it is preferable to exercise immediately. 

SPayoff: Old vector Payoff at time t+1 

$Strike: Exercise price of the option 

$TimePresent: Time t 

SNbTraj: Number of paths 

sr: Interest rate 

sDeltaT 

%S1 S2 S3: Vectors of assets prices at time t 


sVector containing the paths where the asset price is less than 


o 


$the exercise price (otherwise it is not preferable to exercise 
% the option) 

SelectedPaths= (max ([S1-Strike,S2-Strike,S3-Strike]’)’>0); 
S1Temp=S1.*SelectedPaths; 

S2Temp=S2.*SelectedPaths; 


S3Temp=S3.*SelectedPaths; 


$Matrix of regressors 


X= [ones (2%NbTraj,1).*SelectedPaths,S1Temp,S1Temp.*2,S1Temp.%73,... 


S2Temp, S2Temp.*2,S2Temp.%3,S3Temp, S3Temp.*2,S3Temp.%*3,... 
S1lTemp.*S2Temp,S1Temp.*S3Temp, S2Temp.*S3Temp,... 
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SiTemp.*2.+*S2Temp,S1Temp.*2.%*S3Temp,S2Temp.*~2.*S1Temp,... 
S2Temp.*2.+*S3Temp,S3Temp.*2.*S1Temp,S3Temp.*2.*S2Temp,... 
S1Temp.*4,S2Temp.*4,S3Temp.*4,S1Temp.*3.*S2Temp,... 
S1Temp.*3.+*S3Temp,S2Temp.*3.*S1Temp,S2Temp.*3.*S3Temp,... 
S3Temp.*3.+*S1Temp, S3Temp.*3.*S2Temp,S1Temp.*2.*S2Temp.*2,... 
SiTemp.*2.+*S3Temp.*2,S2Temp.*2.%*S3Temp.%2] ; 


sVector of expected values 
Y=Payoff.+*#SelectedPaths; 


%$Regression to find the coefficients 
A=inv (X' «X) «X' +Y; 


$Calculation of the expected values when the option is not 
exercised 
Continuation=(X+*A) .*SelectedPaths; 


S$Values when the option is exercised immediately 
Exercise=max ( [zeros (2*NbTraj,1),S1-Strike,S2-Strike, 
S3-Strike]’)’; 


Paths, exercise decision and updating of the vector Payoff. 
for i=1: (2«NbTraj) 
if ((Exercise(i,1)>0)&(Exercise(i,1)>Continuation(i,1))) 
Payoff (i,1)=Exercise(i,1); 
end 
end 


end 


10.2.2 Simulations Using The Dynamic Programming Technique of Barraquand 
and Martineau (1995) 


The Dynamic Programming Technique has been developed to price American options written 
on n underlying assets 


S = (S1, S2,..., Sn). (10.28) 


The fundamental idea of this method is to divide the approximation space (dimension n) into 
cells. Next, we assume that the payoff function and the optimal strategy are constant over each 
cell. Finally, we need to evaluate the price of the option on this partitioned space. The steps of 
the algorithm to be implemented are presented below. 

Since the option is written on n underlying assets, the payoff function is given by: 


P:R” xR —R 
(S,t) — P(S, +). (10.29) 
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We divide the time to maturity of the option according to {0, At,..., T }. If we have a partition 
Q of R, then we can construct a partition of R” x t given as follows 


Partition;(t) = {S € R"|P(S, t) € QO}. (10.30) 
We assume the partition Q to be composed of k sets. The intervals Q; (t) will be given by 
DO =JA MFO, A] (10.31) 
for i € [2, k — 1] and by 
Q(t) = ] — œ, A®)], (10.32) 
Q(t) = JAG) OE, +00]. (10.33) 


We will then have the following cells: 
Partition; (t) = {S € R'A eP O < P(S, t) < Ate? OY} (10.34) 
fori € [2, k — 1] and 


Partition, (t) = {S € R"|P(S, t) < A(t)} (10.35) 
Partition,(t) = {S € R"|P(S,t) > Ae 4}. (10.36) 

We choose A(t) and B(t) such that 
Prob(S(t) € Partition;(t)) © Prob(S(t) € Partition,(t)) ~ 0.1%. (10.37) 


The choice of 0.1% is somewhat arbitrary; it will depend on the dispersion of our paths and 
the problem under study. Let’s denote by S!(t),... , S“(¢) the M generated paths of the process 
S(t). We then define 


aj;(t) = Card{m e€ [1, M]|S” (t) € Partition; (t)}, (10.38) 
Bi, i(t) = Card{m € [1, M]|S” (t) € Partition; (t), 

S” (t + At) € Partition; (t + At)}, (10.39) 

no= X PS"@),0, (10.40) 


S™ €Partition; (t) 


where Card stands for the cardinal or the number of elements in the set. These numbers will 
be useful because 


i(t 
aN (10.41) 
a; (1) 
is the average value of payoffs on cell Partition,(¢) and 
Bi, ;(t) 
ats 10.42 
a; (t) 


is an approximation of the conditional probability of moving to cell Partition;(¢ + At) when 
we are already in cell Partition;(t). We can therefore use the following algorithm to calculate 
the price of the American option. 


1. At time T, we calculate 


vi(T) 
a;(T) 


Price(i, T) = (10.43) 
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2. At time T — At, for each i € [1, k] we have 


k 
(T — At . i;(T — At 
Price(i, T — At) = max I =A: X Price(j, ryf = AD (10.44) 
aiT — At) “> aj(T — At) 
3. We apply the previous step recursively to calculate Price(i, T — 2At),... , Price(1, 0). 


4. Since we buy the option at time — Aż so that the exercise decision period begins at time 
t = 0, we can obtain the final price by calculating 


e '“'Price(1, 0). (10.45) 


MATLAB Program 


We propose a MATLAB program that was used to implement the algorithm presented above 
on the Dynamic Programming approach. The method has been applied to calculate the price of 
an American option written on three underlying assets. The payoff of the equivalent European 
option is given by: 


Payoff = max(0, Sı(T) — K, S(T) — K, S3(T) — K). (10.46) 


function SSA 

$Function to calculate the price of an American option 
Sunderwritten on 3 underlying assets and 

$using the dynamic programming technique of Barraquand and 
Martineau. 

$T: Maturity 

T=1/12; 

SStrike: Exercise price 

Strike=35; 


oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 


itial parameters 


JP oP ol? 

H 

B 
A oA o 
A oA o 
A A o 
A oA oA oA o 
A A o 
A oA o 
HP A o 
A oA o 
A oA o 
JP A oP 


oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 


k=100; Number of cells 

NbTraj=100000; Number of paths 

NbStep=10; sNumber of time steps 

n=3; sNumber of underlying assets 

mu=ones (n,1) *0.00; SDividend yield on the underlying assets 
r=0.05; SRisk-free rate 

DeltaT=T/NbStep; Time step 

sqDeltaT=sqrt (DeltaT) ; 


The vector partition keeps in memory the cell containing the asset 
The first column indicates the cell at time t and 

he second column indicates the cell at time t+1. 

artition is initialized as a vector of un, meaning that 

the 3 underlying assets start in cell P1(0). 

partition=[ones (NbTraj,1),zeros(NbTraj,1)]; 


A AO A oP 


‘tO cd 
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%The vector payoff stores the cash flows if the option is 


exercised. 


zeros (NbTraj,1); 


payoff 


b and gamma represent the parameters of the 


%The vectors a, 


we need to calculate them. 


S$Barraquand and Martineau technique, 


zeros (k,NbStep) ; 


a 
b 


zeros (k,k,NbStep) ; 


zeros (k,NbStep) ; 


gamma 


SMatrix of the standard deviations 
Svariance-covariance matrix 


% (independent assets) 


diag([0.2,0.3,0.5]); 


v= 


=v» (v'); 


kappa 


%x0 is the vector of initial values of the assets 


(40 is the initial value) 


2 
6 


40xones (NbTraj,n); 


x0= 


%xl is the vector of the assets values at time t and x2 at time t+1 


x1=x0; 


zeros (NbTraj,n); 


x2= 


oe 
oe 
oo 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oo 
oe 
oe 
oe 
oo 
oe 
oe 
oo 
oe 
oe 
oe 
pe 
oe 
oe 
oe 
oe 
oo 
oe 
oe 


%End of the initialization 


ooo 
C08 
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oe 
oe 
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oe 
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oe 
oo 
oe 
oe 
o9 
oe 
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oe 
oo 
oe 
oe 
oe 
oe 


=0% 


b and gamma for the time t 


SCalculation of a, 
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the others are zero since there is only one cell 


= M, 


_1(0) 
aInit 


2 
6 


NbTrajj; 


is the only non null 


alone since a_1(0) 


$Calculation of gamma_i (0) 


element 
gammaInit 


exp (-r«DeltaT) «sum(Payoff (x1,NbTraj,Strike,DeltaT,r)); 


oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oo 
oe 
oe 
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b and gamma for t=0%% 


%End of the calculation of a, 
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SLoop for each time step 


o9 
oe 
oe 
oe 
oe 
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oo 
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oo 
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oe 
oe 
oe 
oe 
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oe 
oo 
oe 
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oo 
oe 
oe 
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=1:NbStep 


for cptStep 


SCalculation of x2 by simulating one time step 


NextStep (x1,kappa,v,sqDeltaT,DeltaT,mu,NbTraj,n,r); 


x2= 


$Calculation of the cash flows if the option is exercised 


immediately 


Payoff (x2,NbTraj,Strike,DeltaT,r) ; 


payoff 


$Determination of the partition and classification of the paths 


Swith respect to the cash flows 


partition ( 


2) =Partition(payoff,NbTraj,k) ; 


oT 
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SCalculation of the values of a, b and c to determine 
Sthe probabilities 
[a(:,cptStep) ,b(:,:,cptStep) ,gamma(:,cptStep)]=... 
Prob (partition,NbTraj,k,payoff,r,DeltaT) ; 


$The new partition and the x2 become the old 
$partition and the old x for the next time step 
partition(:,1)=partition(:,2); 

x1=x2; 


end 


$Initialization of the Calli to compute the price of the call 
Calll=zeros(k,1); 
Call2=zeros(k,1); 


$We initialize Call2 and avoid the division by zero 
Call2=gamma(:,NbStep) ./max(1,a(:,NbStep) ) ; 


$Loop starting at the maturity and proceeding backward 
by the time step 
for cptStep=1:NbStep-1 


sCalculation of Calll and no division by zero 
Calll=max(gamma(:,NbStep-cptStep) ,b(:,:,NbStep-cptStep+1) 
*Call2)./... 
max(1,a(:,NbStep-cptStep) ) ; 


%The new Calll becomes the old Call2 coming back by one step 
Call2=Calll; 


end 
$Calculation of the option price 


Price=max (gammaInit,b(:,:,1)*Call2)/aInit; 
callssap=Price(1) 


0) 
D 
Q 


End of the main program 
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function x2=NextStep(x1,kappa,v,sqDeltaT,DeltaT,mu,NbTraj,n,r) 
$x1: Values of the assets at time t 

$kappa: Variance-covariance matrix 

Matrix of the standard deviation 

spDeltaT: Square root of DeltaT 

DeltaT: Time step 


< 


oA oP ol? 
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Dividend yield 


Smu: 


Number of simulated paths 
Number of underlying assets 


Risk-free rate 


SNbTraj 


2 
6 


2 
or: 


Values of assets at time t+1 


SX2: 


S$Initialization of x2 


zeros (NbTraj,n) ; 


X2= 


SLoop to simulate the assets prices at time t+1. 


for cptTraj 


1:NbTraj 


),kappa,v,sqDeltaT,... 


DeltaT,mu,n,r)j; 


:)= SimUnderly (x1 (cptTraj, 


x2 (cptTraj, 


end 


end 
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%End of the simulation 
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S$Function to simulate the underlying assets% 


oe 
oo 
oe 
oe 
oo 
oe 
oe 
oe 
oe 
oe 
oo 
oe 
oe 
oe 
oe 
oe 
oo 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oo 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oe 
oo 
oe 
oe 


SimUnderly (x1,kappa,v,sqDeltaT,DeltaT,mu,n,r) 


function x2 


$Same parameters as the previous function but for the paths 


randn(n,1)j; 


at 


v«esqDeltaTsz; 


temp= 


$Initialization of x2 and simulation for each underlying asset 


zeros(1,n); 
for cptUnderly 


x2= 


1:n 


x1(1,cptUnderly) «exp ((r-mu(cptUnderly, 1) - 


x2 (1,cptUnderly) 


kappa (cptUnderly, cptUnderly) /2)*DeltaT + temp 


(cptUnderly,1)); 


end 


end 
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SEnd of the simulation of the underlying assets% 
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SFunction to calculate the payoffs% 
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Payoff (x2,NbTraj,strike,DeltaT,r) 


function payoff 


Values of the assets 


SX2: 
SNbTraj 


Number of paths 
Exercise price 


sStrike 
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sDeltaT 
$r: Risk-free rate 


payoff=zeros (NbTraj,1); 


%For each path, we calculate the payoff if the option is 


exercised 
for cptTraj=1:NbTraj 


payoff (cptTraj,1)=exp(-r«*DeltaT) «max (0,max (x2 (cptTraj,:))- 


strike); 
end 


End of the payoffs calcula 
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function partition=Partition(payoff,NbTraj,k) 

$Function determining the partitioning of the cells and 
$classify the path in the partition it belongs 

spayoff: Vector of cash flows if the option is exercised 


immediately 
$NbTraj: Number of paths 
$k: Number of cells 


partition=zeros (NbTraj,1); 


SOrdering the payoffs to obtain the distribution 


temp=sort (payoff) ; 


%Keep only the non null elements 
tempNonZero=nonzeros (temp) ; 


$Calculation of the factor A to class the payoffs 


o 
$ 
o 

5 


(take the 0.1 percentile) 
The function max is used to avoid a zero 


A=tempNonZero (max (1,floor(0.001lxsize(tempNonzZero,1)))); 


SInscription of the number 1 for the paths in the first cell 


partition=partition+1.s (payoff<=A) ; 


o 
5 
o 

$ 


Determination of the factor B to class the payoffs 
(take the 99.9 percentile) 


B=log (temp (floor (0.999xNbTraj)-1)/A)/(k-2); 


S$Inscription of the number k for the paths in the last cell 
partition=partition+k+ (payoff>Axexp (B+ (k-2))); 
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SInscription of the appropriate number for the other paths 
sby checking all the possible cells 


for cptPartition 


2:k-1 
partition + 


partition 


.* 


cptPartitions (payoff>A+xexp (B+ (cptPartition-2))) 


Axexp (B+ (cptPartition-1))); 


(payoff< 


end 


end 
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$End of the classification of the paths% 
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$Function to calculate the probabilities a,b,gamma% 
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function [a,b,gamma] =Prob(partition,NbTraj,k,payoff,r,DeltaT) 


Table indicating in which cell is located 


the paths at time t and t+1 
Number of paths 


Number of cells 


Spartition: 


oo 


SNbTraj 
Sk 


Vector the cash flows if the option is exercised 


Risk-free rate 


sDeltaT 


Spayofft 


2 
Ko Sard 


The numbers used to calculate the probabilities 


b and gamma: 


E 
ša, 


2 
© 


to move from one cell to another 


b and gamma 


SInitialization of a, 


zeros (k,1); 


a 
b 


zeros (k,k); 


zeros (k,1); 


gamma 


SDiscounting factor computed once 


exp (-r«*DeltaT) ; 


act= 


%All the paths and incrementing the counters 


b and gamma highlighted. 


° 
ga, 


1:NbTraj 


for cptTraj 


a(partition(cptTraj,2),1)=a(partition(cptTraj,2),1)+1; 


b(partition(cptTraj,1),partition(cptTraj,2))=... 


b(partition(cptTraj,1),partition(cptTraj,2))+act; 


gamma (partition (cptTraj,2),1) 


gamma (partition (cptTraj,2),1) 


actxpayoff (cptTraj,1); 


end 


end 
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$End of the calculation of the probabilities % 
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Table 10.23 Prices of an American call option using the Dynamic Programming Technique 


T (year) K ($) Callppg ($) Callssap ($) 
35 8.59 8.6051 
1/12 40 3.84 3.7837 
45 0.89 0.9145 
35 12.55 12.5614 
4/12 40 7.87 7.8971 
45 4.26 4.3071 
35 15.29 15.2957 
7/12 40 10.72 10.6878 


45 6.96 7.0526 


We compare the computed results (Callssap) with those obtained with another pricing 
method (Callppg) given in Barraquand and Martineau (1995). The results are presented in 
Table 10.23. 

The Barraquand and Martineau method is efficient and applicable to various problems. 


10.3 ASIAN OPTIONS 


This example follows the work of Kemna and Vorst (1990). We assume the price of the 
underlying asset to follow a geometric Brownian motion: 
S(t + At) = S(t) +rSOAt + oS(NAW(t). (10.47) 


The goal is to calculate the price of Asian options on arithmetic and geometric means. 


10.3.1 Asian Options on Arithmetic Mean 


Consider the following value for the option on the arithmetic mean: 


I 
E le max (; XC SGT/I) — K, °) (10.48) 


i=l 
For example, when J = 3, we have 
i S(T /3) + SQT/3)+ S(T 
Efe? max ( (T/3) + /3) + ST) x.0)], 

Compute this value using the parameter values: S(0) = $ 100, K = $ 100, ø = 0.30, r = 0.03, 
T = 1 year with 250 time steps. We use N = 50000 simulations and J = 1, 2, 5, 10, 25, 50, 
125. 

Using the previously discussed methods, we save the simulated values. Next, we calculate 
the mean and the payoff of the option as follows: 


(10.49) 


function Rep=ArithOptCall (I) 

$Function to calculate the price of the Asian option 
$on arithmetic average. 

$I: Number of days used to calculate the average 
SRep: Price of the option 

T=3%. 
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$Initial parameters 


S0=100; 
r=0.03; 
sigma=0.3; 
Strike=100; 
Taly 


NbStep=250; 
DeltaT=T/NbStep; 
NbTraj=50000; 


sNumber of days used to calculate the average 
NbDaysAverage=I; 


sNumber of days between two days to calculate the average 
DaysBetween = NbStep/NbDaysAverage; 


S$Vector used to keep the values needed to calculate the average 
A=zeros (NbTraj,1); 


sWe compute each path 

for cptTraj=1:NbTraj 
Path=GenerePaths(S0,r,sigma,1,NbStep,DeltaT) ; 
SFor each day used to calculate the average, we keep the 
Sprice of the asset 
for cptStep = DaysBetween:DaysBetween:NbStep 

A(cptTraj,1)=A(cptTraj,1) + Path(1,cptStep) ; 

end 


$Calculation of the arithmetic average 
A(cptTraj,1)=A(cptTraj,1) /NbDaysAverage; 
end 


$Calculation of the cash flows, the average and the standard 
deviation 
Payoff=exp (-r«T) «max (A-Strike, 0) ; 
std (Payoff) /sqrt (NbTraj) 

Rep=mean (Payoff,1) ; 


end 


function Rep=GenerePaths(S0,r,sigma,NbTraj,NbStep, DeltaT) 
$Function to generate the paths. 

$S0: Initial price of the asset 

Sr: Risk-free rate 

$sigma: Volatility 

$NbTraj: Number of simulated paths 

SNbStep: Number of time steps per paths 

SDeltaT: Delta T 


NuT = (r - sigmaxsigma/2) *DeltaT; 
SqDelta = sqrt (DeltaT) ; 

DeltaW = SqDeltasrandn(NbTraj, NbStep) ; 
Increments = NuT + sigmasxDeltaW; 
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LogPaths = cumsum(cat (2,log(S0)*ones(NbTraj,1) , Increments) , 2); 
Rep = exp(LogPaths) ; 


end 


Table 10.24 presents the simulations results. 


10.3.2 Asian Options on Geometric Mean 


Consider the following value for the option on the geometric mean: 


Ele" max (( Il sarın)” =K, 0). (10.50) 
i=1 


For example, when J = 3, we have 
Efe"? max((S(T /3)S(2T /3)S(T))'? — K, 0)]. (10.51) 


Calculate the value of the option using as parameter values: S(0) = $ 100, K = $ 100, o = 
0.30, r = 0.03, T = 1 year and 250 time steps. We run N = 50000 simulations and 7 = 1, 2, 
5, 10, 25, 50, 125. 

In this case, we use the same program as for the arithmetic mean above except that we need 
to insert a loop to calculate the geometric mean. 


function Rep=GeomOptCall 

$Function to calculate the price of an Asian option on 
geometric average. 

$I: Number of days used to calculate the average 

L=57 
SRep: Price of the option 


$Initial parameters 


S0=100; 
r=0.03; 
sigma=0.3; 
Strike=100; 
Tsi; 


NbStep=250; 
DeltaT=T/NbStep; 
NbTraj=50000; 


sNumber of days used in the average calculation 
NbDaysAverage=I; 


sNumber of days between two dates to calculate the average 
DaysBetween = NbStep/NbDaysAverage; 


SVector used to keep the values needed to calculate the average 
G=ones (NbTraj,1) ; 


sWe calculate each path 
for cptTraj=1:NbTraj 
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Path=GeneratePaths (S0O,r,sigma,1,NbStep,DeltatT) ; 


%ļFor each day used to calculate the average, we keep the 

asset price. 

for cptStep = DaysBetween:DaysBetween:NbStep 
G(cptTraj,1)=G(cptTraj,1)*Path(1,cptStep) ; 

end 


SCalculation of the geometric average 
G(ceptTraj,1)=G(cptTraj,1)*(1/NbDaysAverage) ; 
end 


SCalculation of the cash flows, the average and the standard 
deviation 

Payoff=exp (-r«T) «max (G-Strike, 0) ; 

std (Payoff) /sqrt (NbTraj) 

Rep=mean (Payoff,1) ; 


end 


function Rep=GeneratePaths(S0,r,sigma,NbTraj,NbStep,DeltaT) 
$Function generating the paths. 

%S0: Initial price of the asset 

Sr: Risk-free rate 

Ssigma: Volatility 

SNbTraj: Number of simulated paths 

SNbStep: Number of time steps per path 

SDeltaT: Delta T 


NuT = (r - sigmaxsigma/2) *DeltaT; 

SqDelta = sqrt (DeltaT) ; 

DeltaW = SqDeltasxrandn(NbTraj, NbStep) ; 

Increments = NuT + sigma+DeltaW; 

LogPaths = cumsum(cat (2,log(S0)sones(NbTraj,1) , Increments) , 2); 
Rep = exp(LogPaths) ; 


end 


Table 10.24 Prices of Asian call options 


Arithmetic ($) Geometric ($) 

I Estimation Error Estimation Error 
1 13.1329 0.0964 13.1329 0.0964 

2 10.3149 0.0741 9.9840 0.0724 

5 8.5887 0.0602 8.1774 0.0578 
10 8.0379 0.0569 7.6164 0.0543 
25 7.7484 0.0542 7.3241 0.0516 
50 7.5742 0.0532 7.1500 0.0505 


125 7.4971 0.0529 7.0780 0.0501 
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The results are also summarized in Table 10.24 where the errors are obtained by dividing 
the standard deviation by the square root of the number of simulations. 
We note that the geometric mean is lower than the arithmetic mean, which was expected. 


10.4 BARRIER OPTIONS 


This example stems from El Babsiri and Noel (1998). We assume that the price of the asset 
follows a geometric Brownian process: 


S(t + At) = S(t) +rS(HAt + oS(Q*)AW (0). (10.52) 


The goal is to calculate the prices of barrier options. There are several types of barrier options. 
Here, we focus on knock-in options: Up-And-In and Down-And-In call options. The payoff of 
a Down-And-In call option is the following 


max(S(T)— K,0) if minejo,r) S(t) < H (10.53) 
O if minor) S(t) > H í 


and for an Up-And-In call option 


max(S(T)— K,0) if max;ejo,r) S(t) > H (10.54) 

0 if Max;¢[0,7] S(t) < H’ ` 
where H is the barrier and K the exercise price. To calculate the payoff of these options, 
we need to know S(T) and the minimum or the maximum of S(t). It is possible to generate 
In(S(T)/S(0)) since we know that it has a normal distribution N (rT, 0 JT ). From El Babsiri 
and Noel (1998), one can show that 


Prob ( min In (a) <y | In (Sa) = x) = gor: (10.55) 
te(0,T] S(0) S(O) 


where y < x and y < 0. If we consider the conditional distribution function Fy of the minimum 


y of In), then we know that Fy(y) follows a uniform distribution over the interval [0, 1]. 


Hence, denoting u = Fy(y) and inverting this equation, we obtain 


x — f/x? — 262T In(u) 
y= 5 . (10.56) 


Note that y is non-positive since u is drawn from a uniform distribution between 
Oand 1. 

To perform the simulation of a Down-And-In barrier option, we simulate In(S(T)/S(0)) and 
a uniform random variable u. Then we find the minimum y and obtain the payoff of the option. 
Here is the MATLAB program for the method. 


function DownAndInCall 
S$Function to calculate the value of an exotic call option Down 
And In. 
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SO0O=100; 
Strike = 100; 
Barrier = 90; 


InterestRate = 0.05; 
Dividend = 0.02; 

i ie — ae ba 

sigma = 0.30; 

NbTraj = 1000000; 

Mu = InterestRate-Dividend; 


$Simulation of the final value of the paths. 
Paths = exp(log(S0) +(Mu-sigmassigma/2) *T+ 
randn (NbTraj,1)*sigmaxsqrt (T)); 


Calculation of the value x in the article of El Babsiri and Noel. 
x=log(Paths/S0) ; 


$Simulation of the minimum of the paths 
Uniform = rand(NbTraj,1); 
Minimum= (x-sqrt (x.*#x-2+*sigma*2«*Txlog (Uniform) )) /2; 


SCalculation of the payoffs at the maturity 
PayoffPV=exp (-InterestRatexT) max (Paths-Strike,0) .« 
(Minimum<log(Barrier/S0) ); 


SPresentation of the mean and the standard deviation of the 
payoffs. 

disp(’Estimation’ ) ; 

disp (mean (PayoffPV,1)); 

disp(’Error’); 

disp (std (PayoffPV) /sqrt (NbTraj)); 

end 


For an Up-And-In option, we have 


t (xy 
Prob ( max In (sR) <y | In x) = x) =1- ear (10.57) 
te[0,7] S(O) S(0) 


and also 


x + yx? — 202T In(1 — u) 
> : 


(10.58) 


In this case, y > x and y > 0. The program is almost the same as in the Down-And-In option 
case. Just like El Babsiri and Noel, we use the following parameter values S(0) = $ 100, K = 
$ 100, r = 0.05, dividend = 0.02, NbTraj = 1000000, and T = 1. Table 10.25 presents the 
results obtained from the program. 
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Table 10.25 Simulated prices of the barrier options 


Down-And-In (H = 90) ($) Up-And-In (H = 110) ($) 

o Estimation Error Estimation Error 
0.10 0.0725 0.0007 4.7942 0.0073 
0.15 0.6081 0.0028 7.0977 0.0105 
0.20 1.6384 0.0055 9.1173 0.0139 
0.25 2.9936 0.0086 11.1010 0.0176 


0.30 4.4904 0.0119 12.9615 0.0215 


10.5 ESTIMATION METHODS FOR THE SENSITIVITY 
COEFFICIENTS OR GREEKS 


In this section, we present some methods to estimate the option sensitivity coefficients. Several 
procedures can be used. We discuss the pathwise derivative estimates method, the likelihood 
ratio method and the retrieval of volatility method. 

A simple method consists of calculating the differential numerically over discrete intervals. 
Let’s consider a security or a derivative product, P, depending on the parameter X. We have 


_ P(X + AX) — P(X) 
= lim 3 
AX 30 AX 


(10.59) 


There are some problems with this method. First, to obtain more precise results, we need to 
use small increments AX. However, a very small AX will lead to approximation errors and 


the results will be erroneous. Second, we could use a larger AX and simulate several paths to 
calculate the coefficient and then average the obtained results. It has been shown that with this 
method the estimator is biased. We can run as many simulations as possible, and we will always 
experience bias. That’s why other techniques have been developed to mitigate these problems. 


10.5.1 Pathwise Derivative Estimates 


The estimation technique for sensitivity coefficients has been introduced, for instance by 
Broadie and Glasserman in 1996. It is based on the structure of the derivative of the payoff 
function. Let P be this payoff function (discounted payoff). We know that the price P of an 
option is given by 


P=E[P]. (10.60) 


Assuming the derivative of an expectation is equivalent to the expectation of the derivative, 
we can calculate 


dP  dE[P] dP 
— = x — |. (10.61) 
dX dX dX 

For simplicity, we assume the necessary and sufficient conditions to be satisfied for this 
equality to hold. Therefore, we will only posit that if the payoff function P is continuous with 
respect to the path X(t) then this equality holds. 
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From this equation, we see that we can evaluate the derivative dP/dX using an expectation. 
We start by calculating the derivative d P /d X , then simulate the underlying and finally calculate 
the expectation. Example 5.1 illustrates the method. 


Example 5.1 We use the pathwise derivative estimates method to calculate the sensitivity 
coefficient DELTA of a call option. In this case, we have 


P =e "' max(0, S(T) — K). (10.62) 
Hence, we calculate 
| r S(T) 
a peeps ey — 10.63 
ISO) SDA S0) E 


and we can estimate DELTA (A) using Monte Carlo simulation techniques by simulating 
the underlying asset S (and possibly the interest rate too) and averaging the obtained results. 


We present some estimators for a relatively simple payoff function. In the case of an Asian 
call option for which the average of the prices is observed over the last m days 


m 


= 22 St) = D S(T —iAt), (10.64) 
i=l 
P= or ae §-K) (10.65) 
and assuming the underlying asset price to be log-normally distributed: 
SSO et eee, (10.66) 


then we obtain the following estimators: 


5 
eo a ge a I E EN 
dP T 1 
Y= — liš>K} — n si )dog (S(t;)/S(0)) — (r — ê + o 2/2)ti), (10.68) 
do ‘mo = 
d’P T(K 
na S(t;)t; —T S—K,0 10.70 
p= = (5000-1 max(S — K, 0)] , (10.70) 
where 
wltm) = MK — S) + S(tm), (10.71) 
n(d(u, v, t)) 
Sog —————— 10.72 
g(u, v,t) (v>0} ieee (10.72) 
Bae Ae et 
ee log(v/u) — (r — ê — o LE (10.73) 


ost 
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where n(.) is the probability density function of the normal distribution, u and v are dummy 
variables, and ô is the dividend yield. 

The following program is used to calculate the sensitivity coefficients DELTA and GAMMA 
of an Asian option (arithmetic mean) with the pathwise derivative estimates technique. 


function rep = AsiPath 


CeO. SRisk-free rate 

k=100; Strike price 
sigma=0.25; SStock return volatility 
div=0.03; $Dividend yield = delta 
T=0.2; sMaturity (years) 

S0=90; Stock initial value 


$DELTA and GAMMA estimations for an Asian arithmetic option 
$with pathwise derivatives estimates 


NStep = floor(T*365.25); Number of time step 

DeltaT = 1/365.25; $Length of a step 

NTraj = 100000; sNumber of paths 

m = 30; Number of days used to calculate the arithmetic average 
Si = zeros(2«NTraj,m);% Storage of the values 


o 


SAve = zeros(2s«NTraj,1);% Average values 


%Parameters initialization 

dW = zeros (2+NTraj,1); 

S = SOxones(2*NTraj,1); 

dW=sqrt (DeltaT) *«randn(NTraj,NStep) ; 
dW=cat (1,dW, -dW) ; 

End of parameters initialization 


$Storing the simulation values 
j=1; 
for i=1:NStep 
S = S + S.*((r-div)«*DeltaT + sigmasxdW(:,1i)); 
if i > NStep - m 
Si(:,j) = S 
j=j+1; 
end 
end 
šEnd of Storing the simulation values 


%Computing the average value 
SAve = sum(Si,2)/m; 


%šEstimation of DELTA 
EstimDel = exp(-r«T) * (SAve>=k) .*SAve/S0; 


Del = sum(EstimDel) /(2«NTraj) ; 
DelError = sqrt((EstimDel -Del)’«s(EstimDel - 
Del) / (2«NTraj)) /sqrt (NTraj) ; 
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$Printing the result for DELTA 

fprintf ('\n Option’’s DELTA: %f£', Del); 
fprintf Ni Standard error: %f’', DelError); 
SEnd of Estimation of DELTA 


o 


Estimation of GAMMA 
= zeros (2xNTraj,1); 
zeros (2*NTraj,1); 
= mą(k - SAve) + Si(:,m); 
for i=1:2sNTraj 

if (w(i,1)>0) 

d(i,1) =(log(w(i,1)/Si(i,m-1) ) - (r-div-0.5*sigma*2) 
*xDeltaT)/... 
(sigmasxsqrt (DeltaT) ) ; 

g(i,1) = normpdf(d(i,1))/(w(i,1)*sigmaxsqrt (DeltaT) ) ; 


ZQ Q 
Il 


end 
EstimGam = exp(-r«T) *(k/SO) *2*mxg; 


Gam = sum(EstimGam) / (2«NTraj) ; 
GamError = sqrt((EstimGam -Gam)’s(EstimGam -Gam) / (2*NTraj) ) / 
sqrt (NTraj) ; 


% Printing the result for GAMMA 

fprintf ('\n Option’’s GAMMA: %f£’, Gam); 
fprintf ('\n Standard error: %f£'’, GamError); 
%End of Estimation of GAMMA 


rep == ily 


We run this MATLAB program by changing the initial stock price and keeping the other 


parameter values constant. The simulations’ results are given in Table 10.26. 


10.5.2 Likelihood Ratio Method 


This estimation method of the sensitivity coefficients is based on the structure of the probability 
density function. We know that the price of an option is given by the expectation of its future 


cash flows P. We therefore have the following formula: 


[o6] 
P =E[P] =. P(s)f (s)ds, (10.74) 
0 
Table 10.26 Estimation of DELTA and GAMMA of an Asian option 
Greeks S(O) = $ 90 Error ($) S(O) = $ 100 Error ($) S(0) = $ 110 Error ($) 
A 0.174 0.0013 0.561 0.0017 0.867 0.0012 
T 0.029 0.0015 0.040 0.0016 0.020 0.0010 
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where f(s) is the probability density function of the variable S(T). Assuming that the function 
P does not depend on the parameter X, we have 


dP — d fy” P(s)f(s)ds 


dX dX 
f Bit TOs 
0 
A frosty, 
0 dX f(s) 


d lo 
fe Bos) CET Fleas 


d log da | l 
S(T) 


=E [Ps0 (10.75) 


Note that we have interchanged the differential with the integral, but we will not discuss further 
the complex details associated with this operation. We only assert that it can be done when 
the probability density function f is smooth enough. In general, if the function is continuous, 
we can calculate the likelihood ratio estimator using this approach. Let’s now illustrate the 
approach with an application case on European call options. From 


S(T) = S(0)e®70°/DT+ovTZ (10.76) 


we deduce that S(T) is log-normally distributed and that 


fœ) = r OO (10.77) 


where 


n(z) = l n (10.78) 

Var 

In(x /S(0)) — (r — o? /2)T 
oVT ` 


Assume that we use the technique for a European call option. The payoff function is P= 
e~’T max(S(T) — K, 0). We then have 


d(x) = (10.79) 


dP 
dS(0) 


-E [e max(S(T) — K, 0) LELO so | 


dS(0) 


og (39) ~(r—8—02/2T 
S(O02T 


= E | eT max(S(T) — K, 0) (10.80) 


We found an estimator for the DELTA of a European call option. An interesting remark here 
is that, unlike the re-simulation technique, this is an unbiased estimator. For a European call 
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option, we find the following estimators: 


A=E jer max(S(T) — K, 0) (log (S(T)/S(0)) — (r — 6 — 7 27)| , (10.81) 


S(0)o2T 
V=E lew max(S(T)— K, 0) (-asry E — z) ; (10.82) 
E e d(S(T))? —d(S(T))o VT — 1 
FSE £ max(S(T)— K,0) S(0202F l ; (10.83) 
p=E e” max(S(T)— K, 0) ( T+ 1cOwr)) : (10.84) 


It is possible to find estimators for other options with the likelihood ratio method. However, 
the estimators obtained with the method converge slowly. Moreover, some of these estimators 
are difficult to calculate. We would therefore like to have an easy and practical method that 
can be used for most types of options. 

Here is a MATLAB program to compute the DELTA and GAMMA of a European option 
using the likelihood ratio method. 


function rep = EurLik 
r=0.1; SRisk-free rate 
k=100; SStrike price 


sigma=0.25; Stock return volatility 
delta=0.03; Dividend yield 

T=0.2; sMaturity 

S0=110; $Initial stock price 


NStep = floor(Tx*365.25); Number of time step 
Delta 1/365.25; $Length of a step 
NTraj = 100000; $Number of paths 


sSimulation of paths 

dW = zeros (2«NTraj,1); 

S = SOxones(2«NTraj,1); 

Temp =zeros(NTraj,1); dW = cat(1,Temp, -Temp) ; 


for i=1:NStep 

Temp = sqrt (Delta) «*randn(NTraj,1); 

dW = cat (1,Temp,-Temp) ; 

S = S + S.x((r-delta)xDelta + sigmasdW) ; 
end 
End of Simulation of paths 


o 


$Computation of the values used in the estimators formulae 

d = (log(S/SO) - (r - delta - 0.5+*sigma*2)+T)/(sigmaxsqrt (T)); 
ddsig= (log(S0O) - log(S) + (r - delta +0.5s%sigma*2)+*T) /(sigma*2 
xsqrt (T)); 
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SEstimation of DELTA 
EstimDel = exp(-r«T) «max (S-k,0) .*1/(SO«sigma*2*T) .*(log(S/S0O) -... 
(r -delta - 0.5*sigma*2)+*T) ; 


Del = sum(EstimDel) /(2«NTraj) ; 
DelError = sqrt((EstimDel -Del)’+s(EstimDel -Del) /(2«NTraj))/ 
sqrt (NTraj) ; 


$Printing the result for DELTA 

fprintf ('\n DELTA of the option: %f’, Del); 
fprintf ('\n Standard error: %f’', DelError); 
%šEnd of Estimation of DELTA 


%šEstimation of GAMMA 
EstimGam = exp(-r«T) *max(S-k,0).*(d.*d - dxsigmaxsqrt (T) -1)/... 
(SO*2*sigma~2+T) ; 


Gam = sum(EstimGam) / (2«NTraj) ; 
GamError = sqrt((EstimGam -Gam)’*(EstimGam -Gam) /(2*NTraj) ) / 
sqrt (NTraj) ; 


$Printing the result for GAMMA 

fprintf ('\n GAMMA of the option: %f£', Gam); 
fprintf ('\n Standard error: %f£', GamError) ; 
SEnd of Estimation of GAMMA 


rep = 1; 


We run this program by changing the initial stock price and the simulation results are shown 
in Table 10.27. 


10.5.3 Retrieval of Volatility Method 


This method suggested by Cvitanic, Goukassian and Zapatero (2002) allows us to find the 
volatility of an entire portfolio without having to determine the volatility of all individual 


Table 10.27 Estimation of DELTA and GAMMA for a European option 


Greeks S(0) = $ 90 Error ($) S(0) = $ 100 Error ($) S(O) = $ 110 Error ($) 
A 

Exact 0.222 0.568 0.844 

Likelihood 0.221 0.0025 0.564 0.0041 0.848 0.0055 
r 

Exact 0.030 0.035 0.019 


Likelihood 0.029 0.0005 0.034 0.0007 0.019 0.0008 
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assets composing the portfolio. As usual, we assume that S(f) satisfies the equation 


dS(t) 


——— = r(t)dt + o (dW 10.85 
sO) r(t)dt + o(t)dW t) ( ) 

and the risk-free asset satisfies the equation 
dB(t) = Bi(t)r(t)dt. (10.86) 


We start by assuming that the market uncertainty is captured by a single Brownian motion 
dW (t). If P(t) is the payoff function at time ¢ (non-discounted amounts), then the value of the 
portfolio P(t) is given by 


PO) = Eek Om Bry), (10.87) 
The value of the portfolio follows a stochastic differential equation of the form 

dP(t) = a(t)dt + v(t)dW(t), (10.88) 
where v(t) is an adapted process. In discrete time, we obtain 

AP(t) = a(t)At + v(t)AW(t). (10.89) 
Hence, we have 


E[AP(t) -OAP = E [pOAW (OY 


t+At z 
~E [i voat) 
an 
= Í v(u}du 


= v(t) At, (10.90) 


2 


where the second line is obtained from the first line using the Ito isometry. From these 
calculations, we can deduce the following expression for v(t) 


2 1/2 


2 1/2 
= (zim E [a |) (10.91) 
At 


The term a(t) At disappears since its square value is negligible. A second approach allows us 
to find an expression for v(t) which is more efficient for simulation purposes. 


E[AP()AW(t)] = EKAP O) — a(thAnAW(t)] 


t+At 
=E p vadu 


xX v(t)At. (10.92) 
Thus 


Povo) 


v(t) = lim E | > (10.93) 
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We can now write an algorithm to calculate the volatility of a portfolio. 


Step 1 Calculate P(t) using Monte Carlo simulations. 
Step 2 Generate AW(t),i=1,..., NbTraj where NbTraj is the number of simulations. 
Step 3 Move one step ahead for all parameters, i.e. calculate 


S(t + At! = SA +r HAt + o(t)AW (ty) (10.94) 


and the other stochastic parameters. 

Step 4 Evaluate P(t + Ar)’ using Monte Carlo simulations (M simulations) with starting 
value S(t + Ap). 

Step5 Calculate 


NbTraj 


me 1 3 (P(t + AN- POAR 


10.95 
NbTraj = At ( ) 


With this technique, it is possible to dynamically hedge our position. We only need to short a 
number 


v(t) 


o(t)S(t) (10.96) 
of stocks S to hedge the portfolio. Indeed, we have 
v(t) rA) s 
ca sea (10.97) 


and hence we can eliminate the portfolio risk by shorting this number of stocks (because the 
stochastic parameters offset each other). 

Here is a MATLAB program to apply the retrieval of volatility method. We start by writing 
the function used to calculate the value of the call option. 


function rep=PriceCall(T,NbTraj,NbStep,S,r,sigma, DeltaT) 

Simple function to calculate the value of an European call option. 
$T: Maturity 

SNbTraj: Number paths to simulate 

sNbStep: Number of time steps by path 

$S: Initial price of the stock 

Sr: Risk-free interest rate 

$sigma: Volatility of the underlying stock price 

sDeltaT: Time step 


$Initial parameters 

K=90; 

S = Sxones(2«NbTraj,1); 

r = reones(24«NbTraj,1); 

sigma = sigmaxones(2«NbTraj,1) ; 


Loop to simulate the paths at each time step 
for i=1:NbStep 

Temp = sqrt (DeltaT) «randn(NbTraj,1); 

dW = cat(1,Temp,-Temp) ; 

S = S + S.x«(rs*DeltaT+sigma.«dW) ; 
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r= r + 0.0824%(0.06-r)*DeltaT - 0.0364*sqrt (r) .*dW; 
sigma = sigma + 0.695*(0.1-sigma) *«DeltaT + 0.21«dW; 
end 


$Calculation of the option price 
Price = exp(-rx«DeltaTs«NbStep) .*max(S - K,0); 
rep=mean (Price,1); 


end 


We can then write the rest of the program. 


function rep=Retrieval 

$Function to retrieve the volatility of a financial instrument 
S$based on the method proposed by Cvitanic et al. 

$Here, the financial instrument is a call option. 


SInitial parameters 


SO = 100; 
Sig0 = 0.1; 
RO = 0.06; 
Te 22 O03 


NbTraj=10000; 

NbStep=100; 

NbTrajSim2=100; Value of M in the theory 
DeltaT=T/NbStep; 


$Calculation of the price of the option at time t=0. 
PriceZero=PriceCall (T,NbTraj,NbStep,S0,R0,Sig0,DeltaT) ; 


SSimulation of the first time step to find the values of the option 
Sat time t=DeltaT 

Temp=sqrt (DeltaT) *«randn (NbTraj,1) ; 

dW1 = cat (1,Temp,-Temp) ; 

SOne = SO + SO.*(RO«#DeltaT+SigO.+dW1) ; 

rOne = RO + 0.0824x*(0.06 - RO)*DeltaT - 0.0364xsqrt (RO) .«dW1; 
sigmaOne = SigO + 0.695«*(0.1 - Sig0)*DeltaT + 0.21+dW1; 


$Vector to store the option prices at time t=DeltaT 
PriceOne=zeros (2*NbTraj,1) ; 


$Calculation of the prices at time t=DeltaT 
for j=1:2sNbTraj 
PriceOne (j,1)=PriceCall (T-DeltaT,NbTrajSim2, 
NbStep-1,SOne(j,1),... 
rOne (j,1),sigmaOne(j,1),DeltaT) ; 
end 


$Calculation of the volatility using the method presented. 
rep=mean ( (PriceOne-PriceZero) .+*dW1,1)/(DeltaT) ; 


end 
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Table 10.28 DELTA coefficient calculated using the retrieval of volatility method (RVM) 


T = 0.1 year T =0.3 year T = 0.5 year 

DELTA Price ($) DELTA Price ($) DELTA Price ($) 
K =$90 
BS 0.9998 10.5385 0.9887 11.6272 0.9744 12.7303 
RVM 0.9918 10.5389 1.0053 11.6282 0.9755 12.7301 
K = $ 100 
BS 0.5814 1.5794 0.6391 3.1731 0.6771 4.5027 
RVM 0.5894 1.5736 0.6358 3.1781 0.6774 4.4940 
kK =$110 
BS 0.0025 0.0023 0.0831 0.2035 0.1872 0.7022 
RVM 0.0025 0.0024 0.0831 0.1985 0.1868 0.7050 


For a call option, using Black and Scholes, we have: 


dCall  dCall 1 ə?Call dCall ~ 
dCall = So? dt SodW. 10.98 
s ( Pea aa ae ) E aoe) 
Hence, <> = ocal is the DELTA of the call option. We compare the simulations results with 


analytical values. Table 10.28 compares the values obtained using the Black and Scholes 
formula (BS) and the results obtained using the retrieval of volatility method (RVM) described 
above in a constant interest rate and volatility regime. 

The advantage of this technique is that it is always applicable no matter what the number 
of stochastic parameters. For example, we run simulations with several stochastic variables. 
Table 10.29 exhibits the simulations’ results of the retrieval of volatility method (RVM) with 
stochastic interest rate and stochastic volatility. 


Multi-factors 


The retrieval of volatility method can also be applied when there are two sources of uncertainty 
in the economy. Assume we have 


dS;(t) = SOC dt + on YAW (0) + ond), (10.99) 


Table 10.29 DELTA coefficient with stochastic interest rate and stochastic volatility 


T =0.1 year T = 0.3 year T =0.5 year 
DELTA Price ($) DELTA Price ($) DELTA Price ($) 
K =$90 
RVM 1.0049 10.5325 0.9626 11.6146 0.9548 12.6831 
K = $ 100 
RVM 0.7101 1.5501 0.7936 3.1465 0.8230 4.5295 
K =$ 110 


RVM 0.0693 0.0787 0.3336 0.8989 0.5006 1.9349 
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for i = 1, 2. Thus, the portfolio P composed of many financial instruments have the following 
process 


dP(t) = P(t)(r(t)dt +v Od WO + v2(t)dWo(t)). (10.100) 


We can estimate 


N 


vi(t) = pe (P+ A — O TOL 


(10.101) 


and, to replicate the dynamic of our portfolio, we need to invest —M$ in the first asset and 
—M>$ in the second asset, where 


M,\ — font) opt) ma Ant) 
Gee 2S) (oy). (10.102) 


Example 5.2 For example, if we use this method with two underlying assets and the 
following parameter values: $;(0) = S2(0) = $ 100, K = $ 90, T = 0.1, r = 0.06, 
0 11(0) = 0.1, o 12(0) = 0.2, o2;(0) = 0.3 and 7 22(0) = 0.1, we find the approximation 


18.5289 
p= Gane): (10.103) 
We then have 
M,\ _ ( 19.50 
A) = (a220) ` (10.104 


So we need to short 0.1950 stock Sı and 0.8289 stock Sz to hedge our position. 


Notes and Complementary Readings 


To complement the topics covered in this chapter, the reader will find the following books 
useful: Clewlow and Strickland (1998), Glasserman (2003), Jackel (2002), Tavella (2002). 

Regarding the classics, see Boyle (1977) for the pricing of European options using Monte 
Carlo simulations and Barraquand (1995) for the quadratic resampling. 

The articles by Boyle, Broadie and Glasserman (1997) and Fu, Laprise, Madan, Su and Wu 
(2001) provide good summaries of Monte Carlo simulation techniques. 

For simulation techniques developed to price American options, see Broadie and Detemple 
(1996) for a general overview, Barraquand and Martineau (1995) for the dynamic programming 
with Stratified States Aggregation technique, Longstaff and Schwartz (2001) for the Least- 
Squares Method. 
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For numerical evaluation methods of Asian options with control variables, see Kemna and 
Vorst (1990), and for barrier options with extreme values simulation technique, see El Babsiri 
and Noel (1998). For treatments on exotic options, see Jarrow (1995), Clewlow and Strickland 
(1997a), Zhang (1998). 

To estimate options sensitivity coefficients (or the Greeks), see Broadie and Glasserman 
(1996) and the retrieval of volatility method (RVM) of Cvitanic, Goukassian and Zapatero 
(2002). 

For references on volatility models, the reader can consult the articles by Poon and Granger 
(2003, 2005) and the books by Gatheral (2006), Rebonato (2004), Poon (2005) and Javaheri 
(2005). 
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Term Structure of Interest Rates 


and Interest Rate Derivatives 


To compute the present value of cash flows, one requires interest rates. However, interest rates 
are far from being constant, which makes them very difficult to analyze. In the last decades, 
we have witnessed the development of many models that depict the evolution of interest rates 
in financial markets. Generally, the curve representing the evolution of interest rates obtained 
from zero coupon government bonds with different maturities constitutes what we call the 
term structure of interest rates. An interest rate derivative is a contingent claim having as 
underlyings the interest rates or different assets whose prices depend principally on interest 
rates. 

Term structure of interest rates and interest rate derivatives’ models can be classified into 
two categories: arbitrage-free models and general equilibrium models. In this chapter, we 
present some main classical interest rate models as well as their uses in the valuation of bonds 
and interest rate derivatives. We begin by first presenting the general modeling approach for 
the term structure of interest rates as of Vasicek (1977), then we review the general equilibrium 
approach of Cox, Ingersoll and Ross (1985b) and the affine model proposed by Duffie and Kan 
(1997). Market models of Heath, Jarrow and Morton (1992) and Brace, Gatarek and Musiela 
(1997) are also presented. These models are typical of those used for interest rates modeling 
in continuous time. 

Since the mathematical complexity and the tediousness of the derivation of the models 
presented here will take us away from the objective of this chapter, which consists merely of 
providing the economics and analytical approaches of the topics covered, we will skip most 
details of the derivations of the results. The interested reader can consult the list of references 
provided at the end of the chapter. 

In providing a useful framework for pricing bonds and interest rate derivatives, when 
necessary we make the link between the real world probability and the risk-neutral pricing by 
means of the market price of risk discussed in the previous chapter. 

Since the market models are popular among practitioners, we spend more time on these 
models for which we propose more detailed MATLAB® programs. We must point out that 
the models summarized in this chapter are far from being exhaustive, and the reader looking 
for more breadth and depth could consult the references given at the end of the chapter. 


11.1 GENERAL APPROACH AND 
THE VASICEK (1977) MODEL 


11.1.1 General Formulation 


Let P(t, T) be the value at ¢ of a bond paying $1 at time T, that is P(T, T) = 1. 
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Definition 1.1 The yield to maturity R(t, T) of the bond is the discount rate that makes the 
present value of the cash flows equal to the bond price at time t, and it is obtained by the 
following formula: 


P(t, T) =e ODT), (11.1) 
which implies 
In(P(t, T 
R(t,T) = -aeu (11.2) 


The function R(., T) can be used to describe the evolution of the interest rate of zero-coupon 
bonds with respect to maturity or the term structure of interest rates. 


How many economic state variables do we need to generate R(t, T), for all t? Intuition tells 
us that few state variables are needed if adjacent points are highly correlated. Vasicek (1977) 
proposes only one state variable. This variable is the risk-free interest rate r(t) defined as: 


r(t) = RE, t) = lim R(t, T). (11.3) 


We suppose that r(t) follows the process 
dr(t) = u(r, thdt + o(7, t)dW (t), (11.4) 


where W(t) is a Wiener process. 
Today’s price P(t, T) (at time ft) of $1 payable at time T is only a function of time ¢ and the 
spot interest rate r. 


Remark 1.2 This condition means that bonds’ instantaneous returns for different maturities 
are perfectly correlated. 


Therefore, from Ito’s lemma, we can express a bond price as follows: 


10°P 
2 ðr? 


P 
dP(t, T) = E ae p 
a 


5 (dre 


ip 
ot 


aP əP 1 Sie 
= | — EAK § an | dt 
(Zu + ap a rD 2) 


oP 
aa t)dW(t) (11.5) 
7 
or also 


dP(t,T)= P(t, T)[up(t,T,r)dt + op(t,T,r)dW(t)], (11.6) 
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with 


p a 92 
P ur, t) t+ Ë + holt, 1° 24 
P(t,T) 3 


upt, T,r) = (11.7) 


aP 
FIr t) 


op(t,T,r) = PET) 


(11.8) 


We consider a portfolio composed of two bonds with maturities Tı and T2, and proportions 
(œw, 1 — w) invested respectively in the two bonds. The return on this portfolio is 


olup(t, Tı, r)dt + op(t, Tı, r)dW(t)] 
+(1 — @)[up(t, To, r)dt + op(t, To, r)dW(t)]. (11.9) 


We choose w such that the portfolio is riskless, this implies 
oup(t,T,,r) +A — opt, In, r) =r(t) (11.10) 
and 
wop(t,7T,,r) + — w)op(t, To, r) = 0. (11.11) 
So, 


Me(t,Ti,r)—r(t) — wp(t, To,r)— rt) 
op(t, T\,r) E op(t, T2,r) 


(11.12) 


This equation is called the fundamental equation. Since this result is valid for all Tı and T>, 
then it is independent of T. Finally, we get 


(up(t, T,r) —r(t)) fa: : 
pare ee R (11.13) 


representing the market price of risk. Substituting this expression into equation (11.7), we get 


oP 
ot 


OP” 1 ie a 
+ (ue, +00, Dat, r) =— + 500, t) > =r O)PE,T), (11.14) 
ðr 2 ðr 


with the boundary condition P(T, T) = 1. This equation is the basis equation of the term 
structure of interest rates. 
The general solution of this equation is 


T T T 
P(t,T)= E| exp (-f roar) exp (/ q(t, r(t))dW (tr) — >| araar) |, 


(11.15) 


and 


R(t,T) = ae. (11.16) 
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11.1.2 Risk Neutral Approach 


Here we provide an alternative derivation of the results of the previous section. 
In the absence of arbitrage opportunities, there exists a risk neutral probability 


T T 
dQ = exp (J q(t, r(t))dW(r) — al q(t, roar) dP (11.17) 


such that 
dW (t) = dW (t) — q(t, r)dt (11.18) 
is a Brownian motion under this probability Q. In this case: 
dP(t,T) = P(t, T)[uwe(t, T,r)dt +op(t,T,r)dW(t)] 
= P, T) [rdt + op(t,T, dÑ O]. (11.19) 
Combining equations (11.18) and (11.19) shows that: 


(up(t, T,r) —r(t)) K ; 
opt, T.T) =q(t,r), YT >t. (11.20) 


Note here that q(., .) representing the market price of risk is a function of many variables such 
as the state variable r, the time ¢ and the time to maturity T — t, and then risk-neutral pricing 
does not necessarily imply q equal to zero. Sometimes one may set q equal to zero or constant 
for tractability. 

The price of the bond under the risk neutral probability Q is 


T 
P(t, T) = E? [exv (-f roar )| 
7 T 1 T 
= E| exp (- / roar) exp (/ q(t, r(t))dW(r) — Al q(t, rodz) | 


(11.21) 


11.1.3 Particular Case: One Factor Vasicek Model 


The general model doesn’t help us very much. We consider the particular case where the 
risk-free interest rate follows the Ornstein-Uhlenbeck process: 


dr(t) = aly —r(t))dt + cdW(t) (11.22) 


and the market price of risk q is set constant. In this process, y represents the value towards 
which r(f) is attracted: this property is called mean reversion. In this context 


T T 1 fT 
PT) = E| exp(— f rede) exp f awo- 5 | rar) | (11.23) 


Considering the Brownian motion under the risk neutral probability Q, 


dW(t) = dW(t) — qdt, (11.24) 
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we obtain the following process for the interest rate 
dr(t) = (ay + oq —ar(t))dt + odŵ (t). (11.25) 


We see that r(t) is normally distributed under probability Q. Thus 


T 
P(t,T) = EÊ [exp (-f roar )| 
i 1 T 
= exp (2° [-f roar] + zvar? [-f roar) 


1 2 
= exp (<0 — e *F)(R(oo) — r(t)) — (T — t)R(oo) — atl = oy) , 


(11.26) 
with 
oq o? 
R(oo) = y ~~ 5. (11.27) 
Thus 
iTe e T) 
1 
= R(0o) + (10) — ROO r p e + Fae a ue: 
(11.28) 


The term structure yield curve can be flat, increasing or decreasing depending on values of 
r(t) and other parameter values. We present the MATLAB code for such a simulation as well 


as a possible trajectory in Figure 11.1. 


R(t,T) 


Figure 11.1 Simulation of the one factor Vasicek model 
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function VasiceklFactor 
$Function simulating the term structure using 
S$Vasicek model 


SParameters initial values for one factor Vasicek model 
alpha=1.3; 

gamma=0.05; 

sigma=0.06; 

q==0 1; 

Maturity=3; 

rinit=0.05; 

NbStep=1000; 

DeltaT=Maturity/NbStep; 


SInterest rates simulation 
r=SimulRates (alpha, gamma, sigma, q, DeltaT,NbStep,rInit); 


%Parameters of model to rate R. 
RInfinity=RInf (alpha, gamma, sigma, q); 


%Vector of time steps 
t=[0:1:NbStep-1] *DeltaT; 


$Computation of rates for the points represented in vector r 
R=RInfinity + 
(r-RInfinity) .* (1-exp(-alpha* (Maturity-t)))./(alpha* 
(Maturity-t))+ 
sigma*2.* ((1-exp(-alpha* (Maturity-t))).%2)./(4*alpha*3* 
(Maturity-t)); 


$Graph of the interest rate as function of maturity 
plot ([t,Maturity], [R,rInit]); 


end 


function rep=RIinf (alpha, gamma, sigma, q) 
$Simple function computing parameter R(infinity) of 
$Vasicek model 


rep=gamma-sigma*q/alpha-sigma*2/(2*alpha~2) ; 
end 


function rep=SimulRates (alpha, gamma, sigma, q, DeltaT,NbStep,rInit) 
$Fonction simulating the interest rates for Vasicek model 
Salpha, gamma, sigma, q: Model parameters 

SDeltaT: Time step 

SNbStep: Number of steps 

$rInit: Initial value of interest rate at time t=0. 


SInitialisation of the answer vector 
rep=zeros (1,NbStep) ; 
rep(1)=rInit; 


$For each step, we simulate the rate following the model 
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for i=2:NbStep 
rep (i) =rep(i-1)+alpha* (gamma-rep(i-1))*DeltaT+ ... 
sigma*sgqrt (DeltaT) *randn (1) ; 
end 


end 


11.2 THE GENERAL EQUILIBRIUM APPROACH: 
THE COX, INGERSOLL AND ROSS (CIR, 1985) MODEL 


With the Ornstein-Uhlenbeck process proposed by Vasicek for the short term interest rate, 
negative values for r can result. However, based on a general equilibrium model, Cox, Ingersoll 
and Ross (CIR, 1985a) propose a one-factor model as follows: 


r(t)=6x Y(t), (11.29) 


where the short term interest rate is proportional to a stochastic state variable Y. Under a 
particular process for Y (t), the short term interest rate process can be characterized as: 


dr(t) = K(0 —r(t))dt + oyr OAW (H) (11.30) 
with 
2x0 > 0°, Vt € [0, T]. (11.31) 


Without the restriction (11.31) on o,r can reach zero. Note that in equation (11.30), the form of 
the volatility term prevents negative rates, which is not the case with the Ornstein-Uhlenbeck 
process suggested by Vasicek. 

Comparing equations (11.4) with (11.30) and then substituting the values of u(r, t) and o (r, 
t) into (11.14), assuming again that P is a function of r and t, the bond’s price must satisfy the 
following partial differential equation: 


va Th -) o ee -P =0 (11.32) 
27 y ar2 K T y 3r at y = U, i 


with the market price of risk q given by 


q(t,r) = = ro. (11.33) 


o 


or 


q(t, r)o 
vr (t) 
The constant A is the ‘market’ risk parameter. These specifications allow us to reconcile the 


valuation partial differential equation (11.32) and equation (11.14). 
Finally, it can be shown after lengthy calculations that the solution of equation (11.32) is 


A= 


. (11.34) 


P(t, T) = A(t, Te” OO, (11.35) 
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with 
Dy elle +atyNT—1/2 216 [o? 
AG, T)= | | (11.36) 
(y +e +AT — 1) 4+ 2y 
2X(eYT-9 _ 1 
H@,T)= v ) (11.37) 


(y+tK+A(erE—-9 — 1) 4 2y’ 


y =Vk +A)? +202. (11.38) 


Thus, the term structure is given by 
In(P(t,T)) — rH, T) — At, T) 


11.39 
T-t T-t ) 


R(t, T) = 


and 
2K0 


ari (11.40) 


R(t, œ) = 
When r < R(t, oo), the term structure yield curve is increasing, and when r > rood it is 
decreasing. For values of r between these two bounds, the curve isn’t monotonic. With regards 
to the market price of risk g, Chan, Karolyi, Longstaff and Sanders (1992) estimated empirically 
this parameter for both Vasicek and CIR models and found negative values for q. 
We once again present a program simulating a possible trajectory for the term structure. 
One trajectory is plotted in Figure 11.2. 


function CIR 
$Function simulating the term structure from the CIR model 


%Parameters’ initial values 
Theta = 0.05; 
Kappa = 1.3; 
Sigma = 0.2; 


R(t,T) 


Figure 11.2 CIR model simulation 
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Tesz 

NbSteps = 156; 

DeltaT = T / NbSteps; 

riInit = 0.05; 

Lambda = 0.3; 

Gamma = sqrt ((Kappa+Lambda) *2+2*Sigma”2) ; 
b= 0%; 

r=rinit; 


sDefinition of vector R to keep the simulations results 
R = zeros (NbSteps,1); 


SFor each step, we simulate the term structure for time t 
for i=1:NbSteps 


sComputation of numerators and denominators in formula 
of A and H 

Temp1= (Gamma+Kappa+Lambda) * (exp (Gamma* (T-t) ) -1) +2*Gamma; 

Temp2=2*Gamma*exp ( (Gamma+Kappa+Lambda) * (T-t) /2) ; 

Temp3=2* (exp (Gamma* (T-t))-1) ; 


SComputation of A and H 
(Temp2/Temp1) ^ (2*Kappa*Theta/Sigma”~2) ; 
= Temp3/Temp1; 


T > 
l 


SRates’ simulation 
r= r + Kappa*(Theta-r)*DeltaT + Sigma*sqrt (r) *sqrt 
(DeltaT) *randn (1); 


$Term structure computation at this time 
R(i,1) = (r*H-log(A))/(T-t); 
t=t+DeltaT; 

end 


sGraph of the term structure 
plot ([DeltaT:DeltaT:T], R); 


end 


11.3 THE AFFINE MODEL OF THE TERM STRUCTURE 


In the absence of arbitrage opportunities, the price at time t of a zero coupon bond maturing 


at T is given by the following formula: 
PQ,t+1)= EP [e fe foa], 


where E is the risk neutral probability Q expectation operator. 


(11.41) 


From the Cox, Ingersoll and Ross (CIR, 1985a) general equilibrium model, or the Heath, 
Jarrow and Morton (HJM, 1992) arbitrage model, described in Section 11.4, we can deduce 
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a multi-factor linear model for the short term interest rate. Indeed, we can posit an N factor 
affine model for the term structure under the assumption that the short term interest rate r(t) is 
an affine combination of N state variables Y(t) = (Y 1 (t), Yo(t),..., Yy(t)). Thus, we can write 
the short term interest rate as: 


N 
r(t) = 89 + 5 SiYi) = ôo + 8) Y(t), (11.42) 


i=1 
with the vector Y(t) having an “affine diffusion” that is defined as: 
dY(t) = K0 — Y(t))dt + XUVS()dW (t), (11.43) 


where W is a vector of independent Brownian motions under probability Q, K and È are 
matrices of dimensions N x N, and S is a diagonal matrix whose element on the i diagonal 
is 

[SO]; = a; + b YO. (11.44) 


With these specifications, Duffie and Kan (1996) proved that the bond price given in (11.41) 
can be written as: 


P(t,t +1) = P, T) = e^ OO | (11.45) 


where A(t) and H(t) satisfy the following ordinary differential equations: 


N 


dA(r) 1 
= 6'K H(t) + 5 Da Lin: (11.46) 
dH(r) ia 
7 = —K' H(t) — ; D: O + ôy. (11.47) 


{=l 


These ordinary differential equations can be solved using numerical integration techniques 
with initial conditions A(0) = 0 and H(0) = Oyx1 and the specifications given by equations 
(11.41), (11.42), (11.43) and (11.44). 

The term structure is given by 


In(P(t,T)) HTT -HYG — AC — t) 
Taps T-t i 


R(t,T)= (11.48) 
11.4 MARKET MODELS 
11.4.1 The Heath, Jarrow and Morton (HJM, 1992) Model 


This model was developed by Heath, Jarrow and Morton (HJM, 1992). It uses forward rates 
to generate the complete term structure of interest rates. 

We consider a continuous time economy on interval [0, T]. The zero coupon bond price at 
t paying 1$ at future date T is P(t, T), with 


P(T,T)=1. (11.49) 
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The instantaneous forward rate, f, at time ¢ for maturity T is 


B ð ln(P (t, T)) 


t,T)= 11.50 
fE, T) IT ( ) 
or also 
T 
P(t, T) = exp (-f fesas) : (11.51) 
t 
Term Structure Movements 
We consider the following process for forward rates 
t n t 
FOD = 0.7) fais. Das+Y> | os, Daw), (11.52) 
0 = J0 


where {W;};=1,...,n are independent Brownian motions. The spot interest rate at date r, r(t), 
can be seen as being the instantaneous forward rate at t, that is r(t) = f (t, t), hence we can find 
the spot rate process as follows 


t n t 
rt) = ft.) = FO. + f als, nds + Y` / oi(s, t)dW,(s). (11.53) 
0 ia] 00 
From equation (11.51), we have that 


T 
In(P(t, T)) = -f f(t, s)ds. (11.54) 


Using equations (11.53) and (11.54) in conjunction with a key corollary provided in the 
appendix of Heath, Jarrow and Morton (1992), and after tedious algebra, we get 


In(P(t, T)) = In(P (0, T)) + i (r(s) + b(s, T))ds 
0 
yf ails. Tas +Y f ai(s, T)dW;(s), (11.55) 
i=1 79 i=1 Y0 
with 
T 
at T=- | o;(s, T)ds (11.56) 
and 


bt, T) = -f a(t, s)ds + 5 att, TÈ. (11.57) 
i=1 
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Using Ito’s formula, we find the following process for the bond: 


dP(t, T) = (r(s) + b(s, T)) Pt, T)dt + X ailt, T)P(t, T)dWi(0). (11.58) 


i=1 


Valuation in the absence of arbitrage opportunities and the market price of risk 


Let’s define B(t) as being the value at time t of $1 invested at time 0 


B(t) = exp (f rows) ; (11.59) 
0 


Then we build the relative price process of the zero coupon bond with maturity T by 


EG) 11.60 
BO (11.60) 
Applying Ito’s formula to In(P(t, T)/B(t)) leads to: 
POA). (POT) : IAF o a 
in( ae ) =1n( Oo )+f b(s, T)ds 22 ai(s, Tds 
+> a;(s, T)dW;(s). (11.61) 
i=1 “0 


The price variables {P(t, T1)/B(®), . . . , P(t, T,)/B(); 0 < Tı <... < Ta} are martingales if and 
only if there exists an equivalent martingale measure Q, which means the existence of market 
prices of risk 


{gi(ts Tı, edhe Tn)} 


ie[1,n] 


satisfying the following system of equations: 


0 ait, Tı) ... ant, Ti) q(t; Ti, ..., Tn) b(t, Ti) 
: |= : : +] :  [-d162 
0 ay(t, Tn) see an(t, Tn) Qn(ts Ti, ---, Tn) b(t, Tn) 
So 
bt, T) = — X ait, T)gi(tsTi,-.-. Tn). O<t<7T, <...<T7, <T. (11.63) 


i=1 


Using the Girsanov theorem, we can prove that under risk neutral probability Q defined by 


n fi 1 n Tı 
dQ = exp (of gi(t;T1,..., Tn)dWi(s) — z > qit; Ti,- nas) dP, 
1 1 


(11.64) 
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processes 
ss t 
Win = win - f qit; Ti,- Tds,  i=1,...,n, (11.65) 
0 


are Brownian motions. The risk neutral probability uniqueness implies that matrix 


a(t, Tı) ... ant, Ti) 
: (11.66) 
a(t, Ta) AS an(t, Ta) 


is nonsingular. 
Heath, Jarrow and Morton (1992) demonstrated the following proposition essential to asset 
valuation under the arbitrage-free argument. 


Proposition 4.1 The following conditions are equivalent 


1. Q is the unique equivalent martingale measure such that P(t, T )/B(t) is a martingale. 
2. The arbitrage-free valuation condition is 


Pie T= qT. TD, i=1,...,n, (11.67) 
Y Ti... Tn, Ti, ..., T}, with 
O<Ti<...<T, <T and 0<T/ <...<T <T. (11.68) 


3. The restriction on the drift of the forward rate process is 
n T 
a(t,T)=—) ot, T) (a0 -f o;(t, sds (11.69) 
i=l i 


with qi(t) = qilt; T1,..., Tn). 


With condition (11.69) of the above proposition, we can show that the bond price is defined 
by 


dP(t,T) = P(t, T)| rae + DD aoao], (11.70) 


where W; is Wiener process under the risk-neutral probability. Following Harrison and Kreps 
(1979) and Harrison and Pliska (1981), given a financial asset X paying $ 1 at date T, its price 
at any moment ź (t < T) can be obtained using the risk neutral probability Q as follows: 


X(T) 


— FQ 
ss alae Ea 


g x Bt). (11.71) 
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Using this expression, we obtain the zero coupon bond price 


F. 


P(t,T)= E? |e- JE rods 
B(t) 
EERE n 2 
= Ee F.| 


exp ( ae fi qi(t)dW;(t) — 5 Xia ie gi(tdt) 
BT) 


= BE 


|F, (11.72) 


Remark 4.2 In this last expression, the bond’s price depends on the value of the market 
prices of risk {g;(¢)} under the real world probability, and 1/B(T), the discount factor. These 
two quantities contain: the forward rate {f(0, T)}’s initial term structure, the forward rate’s 
drift {a(., T)} and the forward rate’s volatility {0;(., T), i = 1,.., n}. 

The market prices of risk will allow us to make the bridge between the real world probability 
and the risk-neutral pricing. 


The HJM model allows us to derive many other interest rate models using specific calibration 
of forward rates’ process. Here are some of these. 


The Ho and Lee (1986) Model 


To have the Ho and Lee (1986) continuous time term structure model, we use the following 
formulation for the forward rate: 


t 


fet) = 0.7) + f ats, Dds + | odW (s), (11.73) 
0 0 


with œ satisfying the arbitrage-free condition defined by the following restriction on the 
forward rates’ parameters as follows: 


a(t, T) = —og(t)+o°(T — t). (11.74) 
The forward rate formulation (11.73) is a special case of (11.52) with one factor and constant 


o, therefore (11.74) follows from (11.69). Then, under risk neutral probability Q, we can 
rewrite the forward rate’s process as: 


t ag 
f(t,T) = f,T) +0? (r — 5) +oaW(t) (11.75) 
and the spot interest rate’s process 
~ t 
r= fO,T)+oWO+0°>. (11.76) 


We must note that from these two processes, r and f can be negative with strictly positive 
probability. 
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Equations (11.75) and (11.51) give the following zero coupon bond’s price 


pot) = 1 OP Dm t T nivo) 11.77 
(t, = SE evn ( 5 ( )-a( W). (11.77) 


Now consider a European call on this zero coupon bond, P(t, T), with strike price K and 
maturity T’ with 0 < t < T’ < T. From equation (11.71), we can deduce the call price as 


max(P(T'’',T)— K, 0) 
B(T’) 
= P(t, T)N(h)— KP, T’)N(h—o(f —T')VT' —t), (11.78) 


Call(t) = zef BOJE] 


with 
In (et) F z0 (T -T Sa) 
h= l ; (11.79) 
o(T —T')J/T'—-t 
The Two-Factor Model 
Now consider the following two-factor forward rate process 
df(t, T) =alt,T)dt + ojdW(t) +02e7?7-dWy(t), (11.80) 
— a 


long term factor short term factor 


where W, is the long term factor and W3 is the spread between the short and long term factors. 
In this case, the arbitrage-free condition satisfied by the forward rate’s parameters is 


À 2 a a 
a(t, T) = —o\9qi(t) — 2e? q(t) + of (T — t) — T a -1). 
(11.81) 


Under the risk neutral probability, the forward rate’s process is 


t 2 i apy dy 
f,T) = fO,T)+07t (7 -5)-2(2) er" ee’ —1)— 2e 27 (e? -1)| 


t 
+o,Wi(t) +02 f eT a Wo(s) (11.82) 
0 


and the spot interest rate is 


rit) = fO, n+ors (=). [a eo) 2(1 et] 


t 
+o Wilt) +o f e72 -9dW(s). (11.83) 
0 
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In this case, the price of a call having as underlying the zero coupon bond P(t, T) with strike 
price K and maturity T’ with 0 < t < T’ < T, is given by: 


Call(t) = P(t, T)N(h) — K P(t, T')N(h — v), (11.84) 
with 
In (ED) + 1v2 
h (Ere) 2 (11.85) 
v 
and 
2 2 2er! 403 ay -ATARA aT’ Mt 
vy Set aE =N (e727 — e727 V (eT — e). (11.86) 
The CIR Model Revisited 


In section 11.2, we presented the Cox, Ingersoll and Ross (CIR, 1985) model. Here, the 
objective is to show how one may obtain the CIR general equilibrium model as a special case 
of the HJM arbitrage-free model with a specific forward rate process. 

Recall that the term structure model proposed by Cox, Ingersoll and Ross (CIR, 1985) 
follows from a general equilibrium model. In the one variable case, CIR obtain the following 
term structure process: 


dr(t) = KO) — r(t))dt + oyr OAW (t), (11.87) 
with 
2xO(t) > 0°, vt € [0, T]. (11.88) 


Which implies a bond price process of the following form 
dP(t, T) =r [1 — H(t, T)]P(t, T)dt — H (t, Tjo yr OP (t, T)dW (t), (11.89) 
where 


XerT-dÐ — 1) 
(y +e +ANerT-d — 1) +2y’ 


y=V(k +A) +20?, (11.91) 


H(t,T) = 


(11.90) 


and A is constant. 
In this context, we showed that the market price of risk is given by 


AVr(t 
ieee. (11.92) 
o 
From the bond’s price process, CIR deduce the following forward rate process: 
dH(t, T) 7 dH(s,T) 
tT) =r(——) f (s)(—— Jas. 11.93 
f,T) =r) aT +K ! (s) aT s ( ) 


CIR suggest to reverse this equation to implicitly determine the spot interest rate parameters. 
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Remark 4.3 The CIR model is not coherent with all initial forward rate curves since we must 
have 2k 0(t) > a7. 

However, from the HJM model, to obtain the term structure of interest rates proposed by 
CIR, we start from the following forward rate process 


3H(,T) dH(t,T dH (t,T 
df(t, T) =r ( — £: ar + ( = Ve VrdW 0. (11.94) 


with 


[re D- wf? 0(8)(#42)as] 
r(t)= dHC.T) ; (11.95) 
tee) 


Remark 4.4 The CIR model fixes a specific market price of risk and endogenously derives 
the forward rate process; the HJM model takes a given stochastic process for the forward rate 
and prices the contingent claims from this process. The HJM model is more general than the 
one obtained from an equilibrium model such as CIR. 


11.4.2 The Brace, Gatarek and Musiela (BGM, 1997) Model 


We present next a model known as the Brace, Gatarek and Musiela (BGM 1997) market model. 
The principle consists of describing the market rates process from an already existing model 
such as the one of Heath, Jarrow and Morton (1992). 

BGM describe the LIBOR rates’ process observed on swaps’ markets with the HJM model. 
They then define the different conditions satisfied by these processes. 


The HJM Model Revisited and Notations 


Let f(t, t + x) be the instantaneous continuously compounded forward rate at t for future date 
t + x. Thus for all T > 0, the process 


T 
P(t, T) = exp (-f fa, udu) , O<t<T (11.96) 


describes the price evolution of a zero coupon bond with maturity T. The temporal evolution 
of the discounting function is 


t+x 
xh P(t,t+x)=exp (-f flaw). (11.97) 
t 
We consider the following forward rate process: 


of (t,t +x) a 1 dllo(t, t +x)||? 
Ox 2 Ox 


2 
df(t,t +x) = ( a F an, (11.98) 
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where for all x > 0, the volatility vector process {ø (t, t + x); t > 0} is an adapted process with 
values in R?, while ||-|| stands for the norm. This process of the forward rate is consistent with 
the free-arbitrage conditions in the HJM model. The process: 


t 
B(t) = exp (/ rods) , t>0 (11.99) 
0 
is the accumulated value at t > 0 of one dollar invested at time 0. We know that 
{P(t,T)/BA, O<t<T} (11.100) 


is a martingale, which implies the no arbitrage possibility. With these definitions, we can show 
that 


ED P(O, T)ex (- foe T)dW(s) — sf lo(s Pas) (11.101) 
B er e a aaa 
This implies 
dP(t,T)= Pt, T )(r(t)dt —o'(t, T)dW(t)). (11.102) 


This can be related back to equation (11.70). 


The LIBOR Rate Process 


To characterize the volatility o(¢, t + x) process in the above forward rate equation (11.98), 
we suppose the time period 6 > O (for example ô = 0.25 year or one quarter). ô repre- 
sents the payment frequency of the LIBOR rate. We then consider the LIBOR rate process 
{LIBOR(t, x); t > 0}, defined as: 


1 + ôLIBOR(t, x) = exp (f f(t,t+ udu ) ; (11.103) 
This process has a log-normal volatility structure, that is: 
dLIBOR(t, x) = (-)dt + LIBOR(t, x)y | (t, x)dW (t). (11.104) 
Ito’s lemma applied to expression (11.103) gives: 
1 x+ô 
dLIBOR(t, x) = 54 exp (J f(t,t+ udu ) 


ƏLIBOR(r, 1 
=| Eee 5 (1 + SLIBOR(, x))o T(r, t +x + 8) 


Ox 


x (o(@,t+x+5)-—o(,t+ D) fa 


+i + ôLIBORG, x)) (o (t,t +x +8) — o(t,t + x))'dW(t). (11.105) 
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Comparing this expression to equation (11.104), we have 


SLIBOR(t, x) 
1+ sLIBOR(, x)” 


o(tt,t+x+6)—o(t,t+x)= (t,x). (11.106) 


So the expression of LIBOR(f, x) becomes 


dLIBOR(r, x) 
Ox 
+ LIBOR(t, x)y T (t, x)dW(t). (11.107) 


dLIBOR(t, x) = ( + LIBOR(t, x)y (t, o (t, t +x + D )ar 


From equations (11.106) and (11.107), we can show that {LIBOR(¢, x); t > 0} must satisfy: 


ƏLIBOR(t, x) 
Ox 
LIBOR? (t, x) 

1 + ôLIBOR(t, x) 


dLIBOR(t, x) = ( + LIBOR(t, x)y T(t, x)o(t,t +x) 


lly, wn? )at + LIBOR(t, x)y | (t, DAWO), 


(11.108) 
with o and y linked by the relation defined by the following equation: 
0 if O<x <6 
o(t,t+x)= | MCE) SLIBOR(,x—kò) T ee ee ; (11.109) 


k=1 THSLIBOR(,x—k5) Y 


where Int(+) is the integer part of 5. 

We have described modern modeling approaches for the term structure of interest rates 
and also summarized the most widely known models used by practitioners. In the following 
section, we propose simulation programs to illustrate the implementation of the BGM market 
model. For this purpose, we provide programs to evaluate interest rate derivative products 
such as caps, floors and swaptions. The reader could start from these basic programs to make 
extensions into more sophisticated market models. 


Pricing Interest Rate Caps and Floors 


A cap is a call option on an interest rate or, equivalently, a put option on the bond’s price. A 
floor is a put option on the interest rate or, equivalently, a call on the bond’s value. 
We consider a cap having a strike price of K, resetting at dates Tj = To + jAt, j=1,...,n, 
on a principal of $ 1. For any principal A, the value will be multiplied by the principal value A. 
The cap is a portfolio of n (j = 1, . . . , n) caplets. The LIBOR rate received at date T; is fixed 
at T;_; from equation (11.103) at level 


1 
LIBOR(T;-1, 0) = = (P(Tj-1,7))' — 1). (11.110) 


The (j — 1)” caplet gives at date Tj a payment equal to: 


At max (LIBOR(T;-1, 0) — K, 0). (11.111) 
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The cap’s price is obtained at t < To by taking the expectations under the risk-neutral measure: 


_ poly BO Kyt 
cap(t) = E 3 pap PRI 0) — KY At|F, 
= > P(t, TEF [(LIBOR(T;-1, 0) — K)* At|F;]. (11.112) 


j=l 


Theorem 4.5 BGM show that the cap’s price at date t < To is obtained by the following 
formula 


cap(t) = >a AtP(t, T)) LIBOR(, Tj-1 — t)N (A(t, Tj-1)) 


j=l 
-KN (h(t, Tj-1) — oj VT) =1)|, (11.113) 
with 
[in (LIBOR(t, T)-1 — t)/K) + $02°(T 1 — | 
Cay ae (11.114) 
i oj/Tj-1 —t 
and 
2 1 ne 2 
o= zo) lys, T)-1 — s)|I?ds. (11.115) 


We implement this approach to evaluate a cap. First, we get the P(O, T;) from different 
LIBOR rates observed on markets using linear interpolation. We suppose in the program these 
values are known. Second, we specify a function y(t, x) = y(x). For this, we consider a 2 
dimensional function y. This function is constant over the intervals of length At = 0.25. Here 
is the program: 


function AnsBGM=CapBGM 

$Function computing a cap’s value from the BGM model. 
$Strike: The strike price. 

Strike=0.03; 


$Initial parameters 
NbStep=20; 
DeltaT=0.25; 


SSpecification of a gamma function 


Gamma=[0.00, 0.00; 0.09, 0.12; 0.08, 0.05; 
0.08, 0.05; 0.23, 0.09; 0.23, 0.09; 
0.19; 05037 0.19, 0..035-.0.08,. 0.201; 
0.08; 05027 OTI O01 0.095 0. 015 
Oi 


14, -0.01; 0.14, -0.01; 0.14, -0.01; 
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O014, =o OT 0009, 0057 0.09; =0..05.; 
0.09, -0.05; 0.09, -0.05; 0.14, -0.04]; 


$Forward rates’ specification 

P=[1; 0.9830; 0.9652; 0.9464; 0.9271; 0.9076; 
0.8882; 0.8688; 0.8497; 0.8308; 0.8123; 0.7942; 
OSTIA “OS T5967 OC 74355 0.72743 (O27 T1123 0.6954; 
0.6797; 0.6644; 0.6491]; 


Vector storing the values of LIBOR L computed. 
L=zeros (NbStep,1) ; 


SFor each time steps, we compute the correspondent value for L 
for i=1:NbStep 

L(i,1)=(P(i,1)/P(i+1,1)-1)/DeltaT; 
end 


sVector of computed sigma values 
Sigma=zeros (NbStep,1) ; 


$For each time steps, we compute the value of sigma at this moment 
for k=2:NbStep 
Sigma (k,1)=(Sigma(k-1,1) * (k-2) *DeltaT+norm (Gamma (k, :),2)^2 
*DeltaT) /...((k-1)*DeltatT) ; 
end 


Vector storing the values of h computed 
h=zeros (NbStep,1) ; 


SFor each time step, we computed the corresponding values of h 
for j=1:NbStep-1 
h(j,1)=(log(L(j+1,1) /Strike)+0.5*Sigma(j+1,1)*(j*DeltaT))/... 
sqrt (Sigma (j+1,1)*(j*DeltaT) ); 
end 


$Vector storing the caplets’ prices at each time t 
Caplet=zeros (NbStep,1) ; 


$For each time, we compute the corresponding caplet’s value 
for 1=2:NbStep 
Caplet (1,1)=DeltaT*P(1,1)*(L(1,1) *normedf (h(1-1,1))-... 
Strike*normcdf (h(1-1,1)-sqrt (Sigma(1,1) * 
(1-1)*DeltaT))); 
end 


%Cap's value from the BGM model is the sum of caplets’ values 
AnsBGM=sum(Caplet,1); 


$For comparison, we present the cap’s price computed 
$from Black’s model. For this, we must specify 
$a sigma, here it is chosen to be near the 
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Saverage value of sigma computed in the BGM model 

CapletBlack=zeros (NbStep,1) ; 

sigma=0.15; 

for m=1:NbStep 
di=(log(L(m,1)/Strike) +sigma*2*DeltaT*m/2) /(sigma*sqrt (m*DeltaT) ) ; 
d2=d1-sigma*sqrt (m*DeltaT) ; 
CapletBlack (m,1)=DeltaT*P(m+1,1)*(L(m,1)*normcdf(d1)-... 

Strike*normcedf (d2)); 
end 


SBlack cap’s price is equal to the sum of caplets’ prices 
AnsBlack=sum(CapletBlack, 1) 


end 


At the end of the program, we add a code that prices the cap using Black’s (1976) risk 
neutral valuation. In that part, we suppose a constant volatility equal to 0.15. Table 11.1 shows 
the results obtained with different strike prices. 


Theorem 4.6 Similarly, it is possible to obtain a floor price with the following formula 


floor(t) = Y Ar P(t, 7) KN ( RET Gy an) 0g =a) 


j=l 
-LIBOR(, Tj- — NN (— h(t, 7-1), (11.116) 
with 
[in (LIBOR(t, T)-1 —1)/K) + $0°(T 1 — | 
oe (11.117) 
7 oj/Tj-1 —t 
and 
2 1 Tj- 2 
o= zz) lys, Tj-1 — 9)|?ds. (11.118) 


Table 11.1 BGM cap prices ($) 


Strike BGM Price Black price 


0.03 0.2248 0.2296 
0.04 0.1860 0.1891 
0.05 0.1475 0.1490 
0.06 0.1104 0.1099 
0.07 0.0763 0.0746 
0.08 0.0493 0.0477 


0.09 0.0310 0.0292 
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Table 11.2 BGM floor prices ($) 


Strike BGM price Black price 


0.06 0.0021 0.0017 
0.07 0.0069 0.0069 
0.08 0.0187 0.0204 
0.09 0.0392 0.0423 
0.10 0.0663 0.0710 
0.11 0.0977 0.1043 


0.12 0.1319 0.1405 


Since the program computing the floor price is almost identical, except for the signs in the 
summation, we only report the results. We also give floor prices evaluated in a risk neutral 
world with constant volatility using Black’s model. Table 11.2 presents the results. 


Pricing a Swaption 


A swaption is an option on an interest rate swap. It gives its owner the right to enter into 
an interest rate swap contract at a specific date in the future. Of course, the owner of a 
swaption is not obliged to exercise it. Recall that a swap contract is a commitment between 
two counterparties to exchange cash flows in the future that will be determined according to a 
predetermined formula. 

We consider a swaption with strike price K and maturity To. If settlement dates are: T; = 
To +jAt, j = 1,...,. Then the price at t < To is 


swaption(t) = At ` PU, T)ER [LIBOR(T, Tj-1 — To) — O EE, |s], 
j=1 


(11.119) 
with 


C;=KAt, j=l,...,.n—-1, C,=1+KAt. (11.120) 


Theorem 4.7 BGM showed that a swaption price at date t < To could be obtained from the 
following approximate formula 


swaption(t) = At a P(t, T;)[LIBOR(, Tj-1 —t)N(—so — dj + Tj) 
j=l 
-KN(—s— dp); (11.121) 
with Sọ computed from 


k 


-1 
n 1 

XC (1 (1 + ArLIBOR(, T;—ı — t) exp(T; (so + di) — D) =1, (11.122) 

k=l 


i—i 
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where 


Cp. =KAt, k=1,...,.n-1, C,=1+KArt, (11.123) 


and 1; and d; are determined by 
To 
Ay = DT; = Y (s, Ti-1 — s)y(s, Tj-1 — sds (11.124) 
t 


and 


I 


AtLIBOR(t, T)_1 — t) 
di = > l>21, d=0. 11.125 
2 1+ ArLIBOR(t, T -4 | ( ) 


l=1 


We present a simple program showing the procedure to follow to find the price of a swap 
contract. We suppose that the values of LIBOR, P, y and At (for the MATLAB program 
spread) are already known. Here are the programs in the case where the strike price is 0.06: 


function AnsBGM=SwaptionBGM 
$Function computing the swaption value from the BGM model 


$Initial parameters 

Strike=0.06; 

NbStep=20; 

DeltaT=0.25; 

SAlready specified gamma function 

gamma = [0; 0.0750; 0.0472; 0.0472; 0.1235; 0.1235; 
0.0962; 0.0962; 0.0403; 0.0403; 0.0951; 0.0951; 
0.0702; 0.0702; 0.0702; 0.0702; 0.0515; 0.0515; 
0.0515; 0.0515]; 


SZero coupon bonds prices vector 

P=[1; 0.9830; 0.9652; 0.9464; 0.9271; 0.9076; 
0.8882; 0.8688; 0.8497; 0.8308; 0.8123; 0.7942; 
0.7764; 0.7596; 0.7435; 0.7274; 0.7112; 0.6954; 
0.6797; 0.6644; 0.6491]; 


$Vector storing the LIBOR rates 
L=zeros (NbStep,1) ; 


SFor each step, we computed the LIBOR rate 

for i=1:NbStep 
L(i,1)=(P(i,1)/P(i+1,1)-1) /DeltaT; 

end 


$Vector storing the values of di 
d=zeros (NbStep,1) ; 
$For each time steps, we compute the value of di 
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for n=1:NbStep 
temp=0; 
for o=1:n 
temp=temp+DeltaT*L(o) /(1+DeltaT*L(o) ) *gamma (o); 
end 
d(n)=temp; 
end 


sVector storing the value of Ck computed 
C=ones (NbStep,1) *Strike*DeltaT; 


S$Only the last of Ck is different 
C(NbStep,1)=1+Strike*DeltaT; 


s=fsolve (@spread, 0.01); 


function ans=spread(s) 
%šFunction used in the computation of spread. We must find 
Sthe value of s that cancels the spread. 


$Variable storing the sum of elements 
temp1=0; 
for k=1:NbStep 
temp2=1; 
for i=1:k 
temp2=temp2* (1+DeltaT*L (i) *exp (gamma (i) 
* (s+d(i))-gamma(i)%*2/2)); 
end 
templ=temp1+C(k) /temp2; 
end 


$Return templ-1 since we want to find s such that templ=1. 
ans=temp1-1; 


end 


S$Vector storing the swaption value at each time step 
Swaption=zeros (NbStep,1) ; 

$Swaption’s price computation 

for p=i;n 


Swaption (p)=P (p+1)*DeltaT* (L(p)*normcdf (-s-d(p)+gamma(p))-... 


Strike*normcdf (-s-d(p))); 
end 
%šThe swap’s value is the sum of swaptions 
AnsBGM=sum(Swaption,1) ; 
end 


With a strike price of 0.06, the BGM model gives us a swap value of $ 0.1082. Table 11.3 


shows the results using different strike prices. 
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Table 11.3 Evaluation of a swap using the BGM model ($) 


Strike BGM price 
0.05 0.1487 
0.06 0.1082 
0.07 0.0678 
0.08 0.0286 


0.09 0.0046 


Notes and Complementary Readings 


For the reader looking for more details on interest rates term structure models and interest rate 
derivatives, the following books would be relevant: Hull (2005), James and Webber (2000), 
McDonald (2003), Neftci (2004), Rebonato (1998, 2002, 2004). 

As stated in the introduction, the term structure of interest rates and interest rate derivatives 
literature can be classified into two categories: the arbitrage-free models and the equilibrium 
models. The arbitrage-free models can be studied in Vasicek (1977), Hull and White (1990), 
Heath, Jarrow and Morton (1992), the market models in Brace, Gatarek and Musiela (1997), 
Jamshidian (1997), Miltersen, Sandmann and Sondermann (1997). Equilibrium models have 
been introduced in the paper by Cox, Ingersoll and Ross (1985a). A theoretical reference for 
affine models is the paper of Duffie and Kan (1996). The first empirical applications of affine 
models have been given by Dai and Singleton (2000) and Duffie and Singleton (1999). 

For an overview of the term structure of interest rates, see Courtadon (1985), Gibson-Asner 
(1987), Chapman and Pearson (2001), Yan (2001). For application papers, see for instance 
Hull and White (2000) for the implementation of Brace, Gatarek and Musiela (1997)’s market 
model, Clewlow and Strickland (1997a) for an interesting application of simulation techniques 
in the valuation of interest rate derivatives. 
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Credit Risk and the Valuation of 
Corporate Securities 


Credit risk results in losses incurred by lenders following a counterparty’s default. It is a 
component of risk from debt and represents one of the major risks that must face most financial 
institutions. There are also many other sources of risk related to the activities and operations of 
a firm or a bank. Add to this the advent of Basle II which is scheduled to be effective in 2007, 
and the quantification and management of credit risk becomes of paramount importance. 
Credit derivatives’ markets have constantly grown at rapid paces in the past few years. In- 
vestors increasingly resort to these financial products, more complex and but also more flexible, 
for the risk management of their activities. Indeed, any investor who wants to eliminate or re- 
duce a risk that he doesn’t want to assume or is unable to control can use derivative products in 
order to transfer this risk to other counterparties which are able to manage or willing to absorb it. 
There are two modern fundamental approaches for the measurement and management of 
credit risk: the structural approach and the reduced-form approach. Merton’s (1973, 1974) 
structural model will allow us to value risky debts issued by a risky firm. This approach 
lays down the foundation of corporate securities valuation and in particular of the capital 
structure in corporate finance studied by Modigliani and Miller (1958). We must also note that 
this approach is the basis of many recent commercial models built for the quantification and 
management of credit risk, such as the popular Moody’s KMV credit risk management model. 
In this chapter, we begin by presenting Merton’s model for risky debt valuation and the 
intuitive properties of the term structure of credit risk. Then we explain how a risky debt can be 
associated with a risk-free debt (issued by a risk-free institution such as a government) minus 
a put option. This put option constitutes the total loss or the risk premium. This analysis of 
credit risk using a put option will be used in the following chapter to illustrate the simulation 
techniques’ application from previous chapters to evaluate the total losses and the financial 
guarantee in the context of a portfolio exposed to default risk from many counterparties. 
We end the chapter with an introduction to the reduced-form approach for the valuation of 
a risky debt. This approach is simple to implement since it only needs an estimation of default 
rates which are being calibrated from market data. However, its weakness emerges from the 
difficulty of studying the interaction properties between counterparties involved in the debt 
contracts. 


12.1 VALUATION OF CORPORATE RISKY DEBTS: 
THE MERTON (1974) MODEL 


The value of any debt issued by a firm is a function of three factors: 


1. The rates offered on risk free debts (government bonds or high quality corporate bonds); 
2. The characteristics and covenants stipulated in the debt contract: maturity, coupon rate, 
sinking fund, debt’s seniority, etc; 


248 Stochastic Simulation and Applications in Finance 


3. The probability that the firm reneges partially or fully on its commitments: this is the 
probability of default. 


These are the three essential factors that determine the cost of debt for the borrower which is 
the interest rate required by the lender. 

While there are many theoretical and empirical studies about the term structure of interest 
rates, Merton (1974) is the first to develop a theory for the valuation of debts with default risk 
using contingent claims analysis. He analytically derived the term structure of credit spreads 
as a function of endogenous factors of the firm (its state variables). These models of credit 
risk analysis are known as structural models. 


12.1.1 The Black and Scholes (1973) Model Revisited 


Under the usual assumptions of a complete and perfect market, the dynamics of the value of a 
firm represented by its total assets A through time follows a stochastic diffusion process given 
by: 


dA(t) = wA(t)dt + o A(t)dW(t), (12.1) 


with u being the instantaneous expected rate of return for the firm, o the instantaneous 
volatility of the firm’s returns, and W a standard Wiener process. 

Let Y = P(A, t) be a financial asset whose market value is a function of the firm value A 
and time t. We can write the dynamics of asset Y in the following differential form: 


dY (t) = uyY (t)dt + oy ¥(t)dWit). (12.2) 


By Ito’s lemma, we obtain 


2p 


dY (t) = SAA) + 5 ree 


oP 
2 
7 9A2 (d A(t) + T (12.3) 


Replacing dA by its expression (1.1) into equation (12.3), we find: 


ars patter + Par + oao awo. (12.4) 


1 2 29 
ayo) =|5¢ AW > 


The equality of the terms in equations (12.1) and (12.4) brings us to the following system of 
equations: 


1 , 23? P oP OP 
uyY = = oA 7 + uA + 
2° ss ðA 0A ot f (12.5) 
oyY = o A— 
ðA 


We now consider a portfolio V, composed with two assets: the firm’s asset and asset Y. The 
portfolio is composed of a long position in asset Y and a short position of the firm’s asset with 


ap 
proportion 57. 


oP 
V(t)=Y(t)—- aa: (12.6) 
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The proportions invested in each asset are such that any stochastic component is removed from 
the portfolio V: 


avi) = av) — naa) 
> ðA 
(2 + $2607?) 


= vO V(t)dt. (12.7) 


This is equivalent of saying that portfolio V’s variation is deterministic. Then, in the absence 
of arbitrage opportunities, the return on the portfolio is equal to the risk-free rate r: 


dV (t) = V(t)rdt. (12.8) 


Comparing equations (12.7) and (12.8), we have the following partial differential equation: 


J42 TrA g + a rP =0. (12.9) 
This relation represents the well known partial differential equation of Black-Scholes-Merton. 
This is a parabolic partial differential equation satisfied by all asset Y = P(A, t) such that the 
value at each time is a function of A and t. 

In order to determine the expression of P(A, t), we must specify the boundary constraints of 
the problem as well as an initial condition. These constraints allow us to differentiate between 
the expression of a firm’s debt and its equity, for example. 

As we have seen in Chapter 9, note the absence of the firm’s expected rate of return or any 
expression related to investors’ preferences and expectations. In an arbitrage-free framework, 
all investors assign the same price to a claim on an asset, given that they agree on the asset’s 
value and volatility. 


1 ə? P oP aP 
57 


12.1.2 Application of the Model to the Valuation of a Risky Debt 


Merton (1974) adapted the Black-Scholes-Merton (1973) model described above for the 
valuation of a debt security. 

Under the standard assumptions of a complete and perfect market, we consider a firm 
financed by a unique and homogeneous class of debt and with stocks. The debt consists of 
issuing a zero-coupon bond with maturity T and face value F. 

In the case of a partial or total default, debtors seize all the firm’s assets. The firm’s 
shareholders will only recover the residual value of the assets after the debt has been paid. 

The firm cannot issue new senior debt nor can it issue a debt of the same rank as the actual 


debt and pay dividends as long as the debt has not reached maturity. 
ðP ap 


Consider P(A, t) with t = T — t the time to maturity, since [- = —4,, the debt value 
which is the contingent claim must satisfy equation (12.9): 
tty ,38°P dP oP 
+rA eP = 0. 12.10 
easy Callas y Vale o Ce) 


Merton (1974) showed that in the absence of corporate taxes and bankruptcy cost, Modigliani 
and Miller (1958)’s theorem still holds for a risky firm whose default probability before 
maturity is positive. 
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If we define S(A, t) as the shareholders’ value at any time t, referred to as the firm’s equity, 
we can express the whole value of the firm as the sum of the shareholders’ value and the debt 
value: 


A(t) = P(A, t)+ S(A, t), (12.11) 


with the constraints P(O, t) = S(O, t) = 0, and P(A, t) < A, Vt and t = T — t. Condition 
P(O, t) = S(O, t) = 0 is a condition assuring that P and S take positive values and condition 
P(A, t) <A is another constraint ensuring that the debt’s value at all time is smaller that the 
firm’s total value. 

At the maturity of the debt, if the firm’s total value A(T) is higher than F, the debt will be 
fully reimbursed. In the other case, debtors will only recover the firm’s value A(T). Then the 
debt’s value at maturity is 


P(A, 0) = min(A, F). (12.12) 


In order to get an expression for the debt P(A, t) at date t, we first determine the value of the 
firm’s equity that we associate with a call on the firm’s value as in the Black-Scholes (1973) 
model. 
Equations (12.10) and (12.11) bring us to the following equity valuation equation: 
1 4.5078 as. as 
A rA "S = 0, 12.13 
eRe Ra Via (eR 


with boundary conditions: 


S(A,T) = A — P(A, 0) 


= max(A — F, 0), 
S(O, t) = 0, 
S(A,t) <A. 


We find the expression of a European call option written on the firm’s value A held by 
shareholders and whose strike price is F. The counterparty for this contingent claim is the 
debtor. Black and Scholes’s (1973) formula gives us the following expression for the firm’s 
equity: 


S(A, t) = AN (x1) — Fe" N(x, — 0 VT — 1), (12.14) 
where 
_ In(A/F)+¢ +07/2)(7 — t) 
x)= A (12.15) 
and 
N@E = I e- 2? dz (12.16) 


is the standard normal cumulative distribution function. 
Since P(A, T — t) =A — S(A, t), we can show from equation (12.14) that the debt’s value 
is: 


P(A,T —t) = Fe’? N(x, — oVT — t) + AN(—X}). (12.17) 
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After algebraic manipulations, we get the following form: 


1 
P(A,T — f) = Fe NG, oVT —1) + = x)|; (12.18) 
with 
Fe~ T-) 
qois (12.19) 
A 


also called quasi-debt ratio. 
The expected loss EL is given by: 


EL = Fe"? — P(A,T —1t) 


SS 1 N(—x1) 
= N(—x1 +oVT =t) x Fe"? x [1 

ce ) d* N(—x, +o JT —t) 
From this expression we find the widely used expressions in credit risk models, such as 
KMV-Moodys’, as well as in recent capital regulations of Basle. These expressions are: the 


probability of default PD equal to 
PD = N(—x, +ovT — t), (12.21) 
the exposure at default EAD given by 


|: (12.20) 


EAD = Fe", (12.22) 


and finally the loss given default LGD which is equal to 


LGD = [! : NES) | (12.23) 
> d* N(—x, to VT —1t)1 
In this expression for LGD, the term 
1 N(- 
(=a) (12.24) 


d* N(—x, +oJ/T —t) 
represents the recovery rate. As we can see, the recovery rate depends on the assets’ value, 
the debt ratio and the asset returns’ volatility. We can show that the recovery rate is correlated 
with the firm’s probability of default. 

Since the debt’s value can also be written in terms of a risk premium, we set: 


P(A,T —t) = Fe RODE, (12.25) 


Then R(t, T) is called the yield to maturity and the expression R(t, T) — r is the default risk 
premium. The credit spread’s term structure, R(t, T) — r, is defined by the following equation: 


1 1 
RU.T)-r = —2 In (NG; oVT EN x). (12.26) 


Si 
Thus, Merton (1974) valued a risky debt and its credit risk premium. At each time f, we 
define N(x; — o./T — t) as being the probability that at maturity the debt will be totally 
reimbursed and N(—x; +0/T —t)= 1 — N(x; —oVT — t) as the probability of default. 
These probabilities are expressed in a risk neutral world. 

As we can see in Figures 12.1 and 12.2, at each time ¢, and for a given maturity, the risk 
premium or the credit spread is a function of two variables: the assets’ volatility and the ratio 
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Figure 12.1 Evolution of the risk premium or credit spread as a function of the quasi-debt ratio 
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Figure 12.2 Evolution of the risk premium or credit spread as a function of the assets’ volatility 


of the debt’s present value (discounted at the risk free rate) to the firm’s value, the quasi-debt 


ratio. 


These figures can be built in calling the MATLAB® function below with a vector d* of 
quasi-debt ratios (for Figure 12.1) and a vector o of the firm’s asset volatilities (for Figure 


12.2) as its parameters. 


function RiskPremium=RiskPremium(tau,d,A,sigma,r) 
$Function computing the risk premium for a risky bond 
$Tau: Time to maturity. 

%d: Quasi-debt ratio. 

SA: Initial value of the firm’s assets. 

$sigma: Volatility of the firm's assets. 

Sr: Risk-free interest rate. 


$Computing debt with given parameters. 
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F = d.*A.*exp(r*tau) ; 


$Computation of parameter x1. 
x1 = (log(A./F)+(r+sigma.*2/2) .*tau) ./(sigma.*sqrt (tau) ) ; 


$Computation of risk premium. 
RiskPremium = -(1./tau) .*log(normedf (x1-sigma.*sqrt (tau) ) + 
(1./d) .*normedf (-x1) ) ; 


end 


We see that the risk premium or credit spread required by lenders is an increasing function 
of the quasi-debt ratio and the volatility of the firm’s value. The assets’ volatility stems both 
from systematic and non-systematic factors. 


12.1.3 Analysis of the Debt Risk 


Merton (1974) was interested in the analysis of debt’s risk relative to the firm’s global risk. 
The variable v measures this relative risk: 


vad, as ee 
o 


(12.27) 


where oy represents the debt’s volatility and o the volatility of firm’s assets. From equation 
(12.5), we get: 


dP/IA | N(-x1) 

P. N(=x1) + d*N(@q —oJVT —1) 
Since 0 < N(.) < 1 and 0 < d, we see that 0 < v < 1. A firm’s debt can never be more risky 
(volatile) than the firm itself. Figures 12.3 and 12.4 show the variations of debt’s risk level as 


a function of the firm’s volatility and quasi-debt ratio. These figures have been plotted using 
the following MATLAB program. 


v(d*,T —th=A (12.28) 


T-t =1 
— — — T-t =3 


oy: Debt’s volatility 
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Figure 12.3 Debt volatility as a function of the quasi-debt ratio 
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Figure 12.4 Debt volatility as a function of the firm’s assets’ volatility 


function VolDebt = VolatilityDebt (tau,d,A,sigma,r) 
$Function computing the risky debt volatility 
$Tau: Time before maturity. 

%d: Quasi-debt ratio. 

SA: Firm's assets initial value. 

$sigma: Firm’s assets volatility. 

Sr: Risk-free interest rate. 


sDetermination of the debt value from initial parameters. 
F = d.*A.*exp(r*tau) ; 


sComputation of parameter x1. 
x1 = (log(A./F)+(r+sigma.*2/2) .*tau) ./(sigma.*sqrt (tau) ) ; 


$Computation of debt volatility. 
VolDebt = sigma. *normcdf (-x1) ./ (norm(-x1) + 
d.*normcedf (x1l-sigma.*sqrt (tau) )); 


end 


When the quasi-debt ratio becomes very large, that is, when the ratio of the present value 
of the promised payment F relative to the firm’s value becomes very large, we see that the 
risky debt’s price P goes to the firm’s value. In that case, the firm’s probability of default at 
maturity goes to one. Similarly, the debt risk oy goes to its superior limit which is the global 
risk of the firm. Thus we have the following limit conditions 


lim P(A,T—t)=A_ and lim v(d*,T —t)=1. (12.29) 


d*—> +00 d*—> +00 


When the quasi-debt ratio goes to its inferior limit of 0, the present value of the future 
payment is very small compared to the firm’s market value. Then, the firm is almost certain to 
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Figure 12.5 Credit spread as a function of the debt maturity 
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Figure 12.6 Debt’s volatility as a function of the debt maturity 


honor its payments to its lenders. Thus we get the following limit conditions 


lim P(A, T —t)=Fe"? and lim v(d*,T —t)=0. (12.30) 

Figures 12.5 and 12.6 show the credit spread and the debt’s volatility as a function of time 
to maturity for different debt ratio levels. To generate these graphs, we use the following 
program: 


function Graphs 
$Function plotting the credit spread and the debt’s volatility 


šas a function of different factors. 


$ta is the vector of time steps where we want to 
$compute the spread and the volatility. 
ta=0...12 1333 
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SInitial parameters. 
do=0.8; 

sigma=0.2; 

A=1; 

r=0.05; 


$Computation of the spread and volatility with d=0.8. 
Rmro=RiskPremium(ta,d0,A,sigma,r) ; 
SigmaYo=VolatilityDebt (ta,d0,A,sigma,r) ; 


SComputation of the spread and volatility with d=1.0. 
do=1; 

Rmr=RiskPremium(ta,d0,A,sigma,r) ; 
SigmaY=VolatilityDebt (ta,d0,A,sigma,r) ; 


$Computation of the spread and volatility with d=1.2. 
do=1.2; 

Rmrs=RiskPremium(ta,d0,A,sigma,r) ; 
SigmaYs=VolatilityDebt (ta,d0,A,sigma,r) ; 


Then we plot the graphs. 

figure 

hold on 

plot (ta,Rmrs,’k’,ta,Rmr,’k--'’,ta,Rmro,’k*’); 
xlabel(’Time to maturity’); 

ylabel(’R-r: Spread’) ; 

legend(’d>1’', ‘d=1','d<1'); 

hold off 


figure hold on 

plot (ta,SigmaYs,’k’,ta,SigmaY,’k--'’,ta,SigmaYo,’k*’) ; 
ylabel(’\sigma_y: debt’s volatility’); 

xlabel(’Time to maturity’); 

legend (’d>1’',’d=1','d<1'); 

hold off 


end 


12.1.4 Relation Between The Firm’s Asset Volatility and Its Equity Volatility 


When computing risky debt value, we considered a contingent claim having the firm’s value as 
underlying. This implies that the firm’s volatility is an important variable for the valuation of 
different corporate securities. Unfortunately, since most or all of a firm’s assets are generally 
not traded, it is difficult to directly determine the volatility of the firm’s total assets. 
However, from market data it is possible to determine the firm’s equity volatility if its shares 
are traded on a stock exchange. In establishing the relation between the equity value and the 
call option, Ito’s lemma allows us to get a system of equations linking the market value and 
volatility of equity to the market value and volatility of the firm’s total assets that cannot be 
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directly observed. Then, from the equity market’s data of the traded firm, we can determine 
the volatility and the market value of the firm’s total assets. 

As we previously demonstrated, the firm’s equity can be seen as a call option owned by 
shareholders on the firm’s total assets with the face value of the debt as its strike price. If we 
respectively note A and ø 4 as the market value and the volatility of the firm’s total assets, S 
and os; the market value and the volatility of the firm’s equity, and F the debt’s face value, 
then the equity value can be expressed as the price of a call option as follows: 


S = AN(x) — Fe™ N(x — o44/7), (12.31) 
where 
In(A/F : 22 
ga WAE tawe (12.32) 
OAs/T 
The two volatilities can be linked by the following relation: 
A ðS 
= —— 04, 12.33 
os S aA04 ( ) 
which implies 
A 
Os = qN), (12.34) 
or 
osS 
= 5 12.35 
OA ANG) ( ) 


In conclusion, if we know S and o's, using equations (12.31) and (12.35) simultaneously allows 
us to estimate the values of A and o4. 

To find the values for A and o, we use MATLAB’s function fsolve. This function is 
implemented to find the roots of a function. It allows us to find a value for y such that 


fy) =09. 


For our problem, we begin by building function VolatilityAssets(y). 


function VolAssets = VolatilityAssets(y) 

$Function used in solving the non linear equation to 
Sestimate the volatility and firm's assets value. 
y(1,1): Assets’ value. 

šy (2,1): Assets’ volatility. 


SExtraction to input parameters 
A = y(1,1); 
sigmaA = y(2,1); 


$Initial parameters 


S = 100; 
sigmaS = 0.3; 
r= ).0%/035 

F = 60; 


tau = 1; 
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sComputation of parameters x. 
x = (log(A/F)+(r+sigmaA*2/2)*tau) / (sigmaA*sqrt (tau) ) ; 


sDetermination of outputs. 


VolAssets(1,1) = S-A*normcdf (x) +F*exp (-r*tau) * 
normcdf (x-sigmaA*sqrt (tau) ) ; 

VolAssets(2,1) = sigmaA-sigmaS*S/ (A*normcdf (x) ) ; 

end 


Now we use function fsolve: 
sThese lines solve simultaneously the two equations. 


$Initial value for y0. 
yo = [100;0.3]; 


sSetting the options for function fsolve. 
options = optimset(’Display’,’‘iter’); 


$Call of function fsolve to solve the equations. 
[y] = fsolve(@VolatilityAssets,y0,options) ; 


Using MATLAB’s options function in this manner, we can see the number of iterations 
performed by function fsolve and the value of the variable VolatilityAssets at each iteration. 
With the values used in the program, we find: 


A = 158.2267 
oa = 0.1896. 


12.2 INSURING DEBT AGAINST DEFAULT RISK 
12.2.1 Isomorphism Between a Put Option and a Financial Guarantee 


Because issuing a financial guarantee generates a liability or a cost for the guarantor, Merton 
(1977) derived a formula to value that cost. 

Merton (1977) established an isomorphism between the financial guarantee on a loan and 
a put option in order to apply the Black-Scholes-Merton (1973) option pricing formula to the 
valuation of financial guarantees. 

We consider a European put option written on an underlying asset of value S, with strike 
price K and maturity T. If at maturity the asset value is smaller than K, (S(T) < K), then the 
option holder exercises the option, and if the value is larger, (S(T) >K), the holder doesn’t 
exercise it and will let it expire. In the first case, the option value is K — S(T), and in the 
second case it is zero. The option’s value at maturity date T is then equal to the maximum of 
zero and the difference K — S(T): 


max(K — S(T), 0), (12.36) 
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and its price at each time is determined by the Black and Scholes (1973) formula if the asset’s 
value follows a geometric Brownian motion. 

Now we consider a firm borrowing an amount F by issuing bonds. The firm promises to 
pay the amount F at the maturity date T. 

We suppose that the dynamic of the firm’s assets A follows a geometric Brownian motion: 


dA(t) = wA(t)dt + o A(t)dW (t), (12.37) 


with u the expected instantaneous return on the firm’s assets, o the instantaneous volatility of 
returns on the firm’s assets, and W a standard Wiener process. 

If at maturity date T the firm’s value A(T) is superior to F, (A(T) >F), then the bond’s 
owner will receive F as promised. However, if the firm’s value is inferior to F, (A(T) < F), the 
firm defaults and the bond’s owner seizes the firm’s assets; thus the bond’s owner will only 
receive A(T). The debt value D(T) (or equivalently P(A, 0) in equation (12.12)) can therefore 
be written as 


D(T) = min(A(T), F) (12.38) 
and the value of equity is 
max(A(T) — F, 0). (12.39) 


If the probability of event {A(T) — F < 0} is greater than zero, then there is a risk of default, 
that is, the debt having face value F is risky. 

Now suppose that a third counterparty guarantees the loss without default, for example, a 
government endorser. At the maturity of the debt, if A(T) < F, the value of the debt redeemed 
will be F with the difference F — A(T) being paid by this third counterparty. However, if 
A(T) > F, the firm doesn’t need the help of the third counterparty and the contribution of the 
endorser will therefore be zero. Finally, the amount paid by the endorser will be equal to the 
maximum of zero and the difference F — A(T): 


G(T) = max(F — A(T), 0). (12.40) 


We can then establish an identity between the financial guarantee and the put option, 
supposing that F is equal to the strike price K, and A is equal to the value of asset $. When 
guaranteeing the debt, the endorser is actually issuing a put option that is owned by the firm’s 
shareholders. This option gives the firm the right to sell its assets to the endorser for an amount 
F at the debt’s maturity date T. 

If A(T) >F, the firm will not exercise its right and the financial guarantee will be zero; 
however, if A(T) < F, it will exercise its right and the financial guarantee’s value will be F — 
A(T). From the isomorphism between the put option and the financial guarantee, the value of 
the guarantee is determined by the Black and Scholes (1973) formula. 

The value of the guarantee G at time t is given by: 


G(t) = Fe’? ON (—x1 to VT —t)— A(t)N(—x}), (12.41) 
where 


ve PAO/P+ E+ ENT —t) 
LS ey 


ET (12.42) 
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with r the constant risk-free interest rate, F the bond’s face value representing the amount of 
debt, A the value of the firm’s assets, N(.) the Gaussian cumulative distribution function, and 
o the volatility of the returns on the firm’s assets. 


12.2.2 Insuring The Default Risk of a Risky Debt 


Following Merton’s (1974, 1977) work, Babbel (1989) qualitatively sketched an insurance 
strategy for a bank. 
The proposed strategy for the bank is to buy a put option on the firm’s assets with an exercise 
price F corresponding to the debt’s face value and a maturity T, the same as the debt’s maturity. 
Equation (12.17) giving the risky debt’s value reported below, 


P(A, t) = Fe" N(x, — ot) + AN(—x1), (12.43) 
is modified and rewritten under the following form: 


P(A, t) = Fe" —[Fe"'N(—x, + 0/t) — AN(—x})] 
= Fe" — Put(A, rT), (12.44) 


with 


—_ In(A/F) + (7 +.07/2)t 
X1 = oJt r 


This formulation of the debt’s value shows that it is equivalent to an identical risk-free debt 
minus the put option’s value Put(A, T). 

Since equities are a call option on the firm’s assets, the debt is represented by the difference 
between the firm’s value and the value of its equity. Figure 12.7 shows the bank’s payoff 
function when it takes a position in the previous put option. 

From the bank’s point of view, owning the firm’s risky debt corresponds to having a long 
position in a risk-free asset and a short position in the put option. 


(12.45) 
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Figure 12.7 Debt, equity and assets’ payoffs 


Credit Risk and the Valuation of Corporate Securities 261 


To hedge its risky debts portfolio — that is, to eliminate the risky component corresponding 
to the short position in the put option — the bank must close its position in the put. It must 
therefore enter into a long position in an identical put option to hedge its portfolio. 

By buying this option, or by synthetically creating it, the bank buys an insurance policy on 
the risky debt, which guarantees the recovery of the face value at maturity, regardless of the 
fate of the firm. 

At maturity T, the put option’s value will be 


Put(A, 0) = max(0, F — A), (12.46) 


added to the risky debt’s value P(A, 0) = min(A, F), the bank is assured that the combination 
of these two values (risky debt portfolio and put option) gives F: 


P(A, 0) + Put(A, 0) = min(A, F) + max(0, F — A) = F. (12.47) 


Because the final payoff of the portfolio composed of long positions in the risky debt and 
the insured put option is a sure payoff, that is there is no risky component. In an arbitrage-free 
world, the expected return of the debt asset net of the insurance strategy cost will not be larger 
than the risk-free rate. 

If the bank seeks to make a return superior to the risk-free rate, it must not totally eliminate 
the risky component of its debt. Its insurance strategy must be adjusted to keep only the risk’s 
portion that it is ready to bear and realize the consequent expected return. 

One way to achieve this is to adjust the put option’s strike price in function of the floor 
value of its insurance strategy. Since the new strike price is lower, the bank is only partially 
hedged and the price of its new put option is inferior to the price of the total insurance option. 

To illustrate this idea with an example, we consider the case where the bank wants to insure 
the recovery of at least a fraction a of the debt. The floor value chosen by the bank would 
therefore be œF. To achieve that goal, it must get a put option with a strike price of gF. 

With this strategy, the bank would be ready to take a maximal loss of (1 — @)F.. Figure 12.8 
illustrates this partial insurance strategy. We used the following parameters: F = $200 and 
a = 0.75. 
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Figure 12.8 Illustration of a loan guarantee 
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12.2.3 Establishing a Lower Bound for the Price of the Insurance Strategy 


In the context of a portfolio of loans, which will be covered in more detail in Chapter 13, 
suppose that a bank is interested in synthetically replicating a put option on the whole portfolio 
instead of creating a portfolio of put options for each debt. The reason for this choice is that 
the strategy in the first case costs less for the same hedge level. 

Moreover, the efficiency of the synthetic put option depends on the underlying asset or 
portfolio of assets of this option. Also, the strike price (the floor value) is very important for 
the effectiveness of this strategy. 

Many aspects must be considered to determine the fraction œ to cover. First, we must 
recognize the possibility that many firms default because of specific conditions rather than 
systematic risk. This type of risk is on the one hand diversifiable and on the other hand 
recovered with the risk premium requested from each borrower. Thus, integrating this risk 
factor in the hedging strategy with the put option would be redundant and would constitute an 
additional cost for the bank. To avoid this situation, the lender must fix the floor value below 
the total face value of the debt F. 

Recall that when the bank increases the number of loans to independent borrowers (not 
perfectly correlated), it can predict with more confidence the loss rate on the loans and absorb 
default cases due to micro-factors, and that is because of the diversification effect. 

On the other hand, when a variation of economic factors brings many simultaneous defaults 
by corporations, there will always be some firms that will be only slightly hit by these factors 
and honor their contracts. 

Even if the systematic risk factor affects all firms, its impact will still be unique to each 
firm since the degrees of market risk exposition are different. Thus, it is important that the put 
option’s strike price takes this into consideration. 

Babbel (1989) determined that the chosen fraction a for decreasing the option’s floor value 
will depend on parameters such as the financial leverage and the borrower’s activities. The 
optimal œ parameter can be found using actuarial treatment based on important economic 
criteria. 

In the insurance portfolio context, Leland (1980) and Brennan and Solanki (1981) showed 
that, generally, the optimal insurance strategy would be to hold a portfolio of options, written 
on the same asset but with strike prices fixed at different levels œ;F (where 0 <a; < 1 for i = 
1,..., n, and n is the number of options). 


12.3 VALUATION OF A RISKY DEBT: 
THE REDUCED-FORM APPROACH 


The reduced-form approach has been initially proposed by Jarrow, Lando and Turnbull (1997) 
and later extended by Duffie and Singleton (1999). We present the works of Duffie and 
Singleton (1999) in this family of reduced-form approaches. 


12.3.1 The Discrete Case with a Zero-Coupon Bond 


Before presenting the result in the general case, for the sake of intuition we present the discrete 
case with a zero-coupon bond. Our goal is to show that in a risk neutral context, the price of a 
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zero coupon bond paying X at maturity is given by 


T 
P(t,T)=E2 [exp (-/ Riu, Tau) x| , (12.48) 


where R(t, T) = r(t) + h(HL(), with r(t) the short-term risk-free interest rate, A(t) the hazard 
rate for default at date t and L(t) the expected loss on market value in case of default at date t. 
Because there is a default probability, quantity h x L is qualified as the risk-neutral-mean loss 
rate. E kii . ] is the expectation under risk neutral probability Q. 

We look at a bond whose issuer promises to pay an amount X at maturity without intermediate 
payments before maturity. In this case, we can write the bond’s price the following way: 


P, T) = he OEL lot + D] +- Ae TO ELLP +1, T), (12.49) 


where A(t) is the default probability, g(¢ + 1) is the bond value after default, and P(t + 
1, T) is the bond value if there is no default. This equation means that the bond’s price is equal 
to the discounted value of payments received between ¢t and t + 1 in case of default plus the 
discounted value of price P(t + 1, T), in the absence of default. 


Suppose that 
EL iot +) =U — L()ELIP (t + 1,T)], (12.50) 
then 
P, T) = AOA — LO) + — A) Je" EL [Pt + 1, T). (12.51) 
And since 
P(t, T) =e ODEL [P+1,T)], (12.52) 
then R(t, T) is defined as follows 
e FED = AA — Lite" + (1 —hitye™. (12.53) 
Which implies the following expression for the credit spread 
RE, T) =r) = — 1n (AOA — L@) +A = AA) = —In(1 — AOLA), (12.54) 
or 
R,T) ~r(t) + AOLE). (12.55) 


12.3.2 General Case in Continuous Time 


Now we consider the general case — that is in continuous time and with the possibility 
of intermediate payments. For this approach, the default event is generally a jump process 
modeled by a process H(t) = 1y>7;. The first date when a jump occurs is t. The random 
variable H(t) takes value 1 if there is a default and zero in the opposite case. We can decompose 
the expression for H using H = I + M, with 


TAt t 
r= f h(u)du =f A(u)lwerndu, t>0 (12.56) 
0 0 


and M is a martingale. Variable h is interpreted as being the hazard rate under the risk neutral 
probability. 
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We look at a bond with a possibility of default. Under the risk neutral probability, the bond’s 
price can be written as: 


P(t,T)=E2 f exp (- e randu ) dC(t A »] , (12.57) 


or equivalently 
T s 
P(t,T) = E2 p exp (-f R(u, Tau) acs) : (12.58) 
t t 


with C(t) representing the cumulative coupons payments, t the time of default, and T A s = 
min(t, s). The credit spread is the difference R(t, T) — r(t). 

If we suppose that in the case of a default at date t, the amount paid X’ using the asset is 
proportional to its price just before default with a proportionality factor 1 — L(t): 


X’=(1-L(t))P(@_,T) with P(_,T)= lim P(s, T), (12.59) 


to have an equality between equations (12.57) and (12.58) for all t < t, then it is possible to 
show that the following relation must be satisfied: 


R,T) =rQO+AOLo, (12.60) 
Sa 
spread 


with A the hazard rate and L the expected loss proportion as a percentage of the asset’s market 
value. 


Notes and Complementary Readings 


Complementary references on this chapter’s subjects are: Das (2005), Duffie and Singleton 
(2003), Lando (2004), Schonbucher (2003). 

Generally, this large literature can be classified into two categories: structural approach and 
reduced-form approach. For the structural approach, seminal papers are Black and Scholes 
(1973), Merton (1973, 1974, 1977), Black and Cox (1976) and many others. For the reduced- 
form approach, basic papers are Duffie and Singleton (1999), Jarrow, Lando and Turnbull 
(1997) and many others. 
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Valuation of Portfolios of 


Financial Guarantees 


Nowadays, credit risk management is a very important and challenging task for all financial 
institutions. Many institutions manage portfolios composed of many financial assets. To assess, 
manage and control risks, risk managers need to quantify their portfolio’s risk level. For the 
purpose of credit risk assessment, in the previous chapter we have presented the structural 
model for the valuation of risky debts. 

Financial guarantees are financial instruments which facilitate the management of risks 
linked to credit portfolios held by institutions. In this chapter, we propose two valuation cases 
of financial guarantees and credit insurance portfolios using the simulation techniques studied 
in previous chapters. The first application case presents a portfolio of loan guarantees held by 
a vulnerable guarantor. In the second case, we again analyze a portfolio of credit insurances 
but this time using stochastic interest rates and asset volatilities. 

These application cases are practice exercises, in the credit risk context, for the simulation 
techniques covered in this book. For each application case, we provide the MATLAB 
program. 


13.1 VALUATION OF A PORTFOLIO OF LOAN GUARANTEES 


This section is an adaptation of the work by Gendron, Lai and Soumaré (2002). The goal 
of this study is to use contingent claims analysis (CCA) or option pricing theory to evaluate 
portfolios of vulnerable private loan guarantees. 

To analyze private loan guarantee portfolios, we first consider the value of single guarantees 
that we aggregate into portfolios. Merton (1977) has established the isomorphism between a 
single default-free guarantee and a put option. Since we are dealing with private guarantees, 
this put becomes a complex option on both the value of the guaranteed firm and the guarantor. 
Under the standard Merton (1974) framework, we first describe the dynamics of the firm and 
the guarantor, then evaluate the loss per unit of face value, and finally calculate the value of 
the private loan guarantee. 

We use the standard CCA assumptions (e.g. perfect markets, frictionless markets, no 
taxes, no transaction costs, no asymmetric information, continuous trading, etc.). So, there 
is no moral hazard (hidden action) and no adverse selection (hidden information). We sup- 
pose no violation of the absolute priority rule. When there is violation of the absolute 
priority rule in bankruptcy situations, senior claimants concede to junior claimants (e.g., 
manager and shareholders) part of the assets they are entitled to. We also ignore poten- 
tial agency problems inherent in financial contracting resulting from the conflict of inter- 
est between shareholders and debtholders. For each firm, we also assume a simple capi- 
tal structure consisting of only one zero-coupon bond and equity under a constant interest 
rate regime. 
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The insured portfolio consists of N guaranteed firms (N > 2). There are no cash outflows 
from the firms (e.g. dividends) before the bonds mature. All the bonds mature at the same time 
and we assume no other senior debt. 


13.1.1 Firms’ and Guarantor’s Dynamics 


As in Merton (1974), the firm’s asset follows a continuous diffusion process with constant 
instantaneous mean return and volatility. Since we have N borrowing firms correlated, the 
dynamics of the firms’ asset values {A;, i = 1,..., N} are given by the following stochastic 
differential equations: 


dA, (t) = wjAj(tHdt +o;,A;\HdW,(t), i=1,...,N (13.1) 


where A; is firm i’s total asset value, u; is the instantaneous mean of firm i’s assets returns 
and g; is the instantaneous volatility of firm i’s asset returns. {W;, i = 1,...,N} are standard 
Wiener processes correlated as follows 


pijdt = E[dWi;dW,), Vi, j=1,...,N. (13.2) 
Or under the risk-neutral probability, 
dAj(t) =r A,(t)dt + o A(t)dW,(t), i=1,...,N (13.3) 


where r is the constant risk-free interest rate and { W; i = 1,..., N} are standard Wiener 
processes under the risk-neutral probability. 

The firms borrow money by issuing zero-coupon bonds with the same maturities and face 
values {F;, i = 1,..., N}. We assume the existence of a single guarantor. 

The guarantor’s total asset is also assumed to follow a continuous diffusion process. Let 
Ag denote the value of the guarantor’s total assets. The dynamics of the guarantor are then 
described by the following process: 


dAG(t) = ug Ac(t)dt + og Ac(t)dWc(t), (13.4) 
or under the risk-neutral probability, 
dAG(t) = rAg(t)dt + oc Ac(t)dWe(t), (13.5) 


where Ag is the value of the guarantor’s assets available for the guarantee, ug is the in- 
stantaneous mean of the guarantor’s assets returns, and og is the instantaneous volatility of 
the guarantor’s assets returns. Wg is a standard Wiener process and its correlations with the 
sources of risk of the firms’ returns are 


picdt = E[dW;dWg], VWi=1,...,N. (13.6) 


Wo is a standard Wiener process under the risk-neutral probability. 

The above equations describe the dynamics and the interaction of each borrowing firm with 
the other firms and the guarantor. A firm is influenced not only by its own characteristics (i.e. 
the instantaneous asset returns’ mean and volatility), but also by the dynamics of the other 
firms and the guarantor. 
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13.1.2 Value of Loss Per Unit of Debt 


The loss per unit of debt measures the average portfolio risk level. It represents the total 
portfolio risk to be covered by the guarantor. The value of loss per unit of debt, L;(T), to the 
debtholders of firm i at maturity T is given by: 
max(0, F; — A;(T)) 
L,(T) = , (13.7) 
Fi 

where A;(T) is firm i’s asset value at maturity T and F; is the face value of the debt issued by 
firm 7. The numerator of this equation denotes the payoff at maturity of a put on the underlying 
asset A;, with exercise price F;. Rewriting the equation as max(0, 1 — A,(T)/F;), it is clear that 
the loss is a function of the Merton’s (1974) pseudo debt-asset ratio F;/A;. 

At maturity T, the value of loss per unit of debt L(7) for the portfolio is the weighted average 
of L i (T): 


N 
LT) = Ja; x Li(T), (13.8) 
i=1 
where œ; is the portfolio weight for firm i and is defined as the relative contribution of firm i 
to the total portfolio debt value, i.e., 


F; 
a; = ——_.. (13.9) 


4 Fk 


Using the equivalent martingale argument via the expectation E2[.], the present value of 
the portfolio loss per unit of debt is the expected value of loss per unit at maturity discounted 
at the constant risk-free interest rate r, or: 


L = N 


Å T- „ m0, Fi — Am) 
Bele 2 a 1 Ti 2 


N 


4 = gef rT- ppan FSA; |. (13.10) 
i=1 ae 1 Fk 


Note the similarity between the portfolio loss per unit L(t) and the loss given default LGD 
seen in section 12.1.2. 


MATLAB program 


function rep=MeanPremium 
$Function computing the average insurance premium in the case 
sof a guarantor and three firms. 


$Initial parameters 
Tst; 

NbStep=52; 
NbTraj=100000; 
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r=0.05; 

mu=r; SInstantaneous rate of return for 
$firms under risk-neutral probability 

DeltaT=T/NbStep; 


SInitial values for the firms 
Al=30«ones (NbTraj,1); 
A2=40x«ones (NbTraj, 1); 
A3=50x0nes (NbTraj,1); 


$Initial value of guarantor 
AG=80«ones (NbTraj,1) ; 


sCovariance matrix construction 

Rho12=0.1; 

Rho13=0.5; 

Rho23=-0.3; 

sigmal=0.2; 

sigma2=0.3; 

sigma3=0.5; 

sigmaG=0.25; 

MatrixCov=DeltaTs [1,Rho1l2,Rho13,0;Rho12,1,Rho023,0;Rho13,Rho23,1,0; 
eee 0370540541] 2 

MatrixL=chol (MatrixCov) '; 

SFace values of the firms’ debts 

Debt1=20; 

Debt2=30; 

Debt 3=30; 

SVector of theoretical means of the Brownian motions. 
MatrixMean=zeros (4,1); 


$For each step, we simulate the firms’s prices and the guarantor’s 
value. 
for i=1:NbStep 


%Use of the quadratic resampling technique. 
Temp=MatrixLsrandn(4,NbTraj) ; 
dW=ReQuadrac (Temp, MatrixMean,MatrixCov) ’; 


A1l=A1. x+ (1+muxDeltaTsones (NbTraj,1)+sigmal+dWw ( ) 
A2=A2 . x (1+musDeltaTsones (NbTraj,1)+sigma2+dWw ( ) 
A3=A3.«* (1+muxDeltaTsones (NbTraj,1)+sigma3+dW(:,3)); 
AG=AG. x (1+muxDeltaTsones (NbTraj,1)+sigmaG+dWw ( ) 


end 


$Sum of losses divided by the total of debts. 
LossPayOff= (max (0,Debt1-Al1) +max (0,Debt2-A2) +max(0,Debt3-A3) )/ 
(Debt1+Debt2+Debt3) ; 
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Loss per unit 
o 
A 


Figure 13.1 Loss per unit of debt as a function of the quasi-debt ratio d* 


$Discounting 
Premium=exp (-r«T) «mean (LossPayOff,1) ; 


rep=Premium; 


end 


Figure 13.1 plots the loss per unit of debt as a function of the quasi-debt ratio. Recall from 

Chapter 12 that the quasi-debt ratio is defined by: 
—rT F; 
gpa, 
Aj 

We begin the simulation with the following initial assets values A; (0) = A2(0) = A3(0) = $ 50. 
We compute the amount of debt using the quasi-debt ratio (which is equal for each firm). Then 
we run a simulation of 50 000 trajectories for each value of d* from 0.02 to 4.00. The result is 
plotted in Figure 13.1. 


13.1.3 Value of Guarantee Per Unit of Debt 


Since a private guarantor can default, the value of the guarantee at maturity is bounded by the 
value at maturity of the guarantor, Ag(T). Hence, the value of the private guarantee at maturity 
per unit of debt, G(T), will be the minimum of the expected loss and the residual value of the 
guarantor. 


N 
ne > max(0, F; — A(T) _Ag(T) 


N > ON 
i=l pas Fk pare Fk 


1 N 
“> nin() max(0, F; — Ai(T)), a); (13.11) 
k=1 k 


i=1 


270 Stochastic Simulation and Applications in Finance 


Again, using the equivalent martingale argument, we obtain the following equation for the 
present value of the guarantee: 


N 
; 1 
Gt) = E? | eT- —___ min | X` max(0, F; — A;(T)), Ac(T) J | - (13.12) 
Ži Fx i=l 


This expression can be interpreted as a complex option on a portfolio of puts and on the 
value of the guarantor. 

Given the curse of dimensionality due to the large number of underlying assets involved in 
the problem, we use the Monte Carlo simulations technique studied in previous chapters to 
evaluate these guarantees. 


Matlab Program 


The program simulating the guarantee per unit of debt is similar to the last one except for the 
payoff line. 


function rep=GuaranteePremium 

$Function computing the average guarantee premium in the case of 
sone guarantor and three firms 

SInitial parameters 

Tels 

NbStep=52; 

NbTraj=100000; 

r=0 05> 

mu=r; SInstantaneous return for the firms 
DeltaT=T/NbStep; 

%šInitial values of the firms 

A1=30«ones (NbTraj,1); 

A2=40«ones (NbTraj,1); 

A3=50«ones (NbTraj, 1); 

$Initial value of the guarantor 

AG=80xones (NbTraj, 1); 

$Construction of the covariance matrix. 

Rho12=0.1; 

Rho13=0.5; 

Rho23=-0.3; 

sigmal=0.2; 

sigma2=0.3; 

sigma3=0.5; 

sigmaG=0.25; 

MatrixCov=DeltaTs [1,Rhol2,Rho13,0;Rho12,1,Rho023,0;Rho13,Rho023,1,0; 
0,0,0,1]; 

MatrixL=chol (MatrixCov)'; 


2 


SFace value of the firms’ debts 

Debt1=20; 

Debt2=30; 

Debt3=30; 

$Vector of theoretical average of Brownian motions 
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0.35 


0.25; 


Average guarantee premium 


0 50 100 150 
Guarantor’s initial value 


Figure 13.2 Guarantee per unit of debt as a function of the guarantor’s asset value 


MatrixMean=zeros (4,1); 

$For each step, we simulate the firms’ prices and the 

guarantor’s value. 

for i=1:NbStep 
sUse of quadratic resampling technique. 
Temp=MatrixLsrandn (4,NbTraj) ; 
dW=ReQuadratic (Temp, MatrixMean,MatrixCov) '; 
A1l=A1. x+ (1+musDeltaTsxones (NbTraj,1)+SigmalsdW(:,1 
A2=A2.* (1+muxDeltaTsxones (NbTraj,1)+sigma2«dwW(:,2 
A3=A3.* (1+muxDeltaTxones (NbTraj,1)+sigma3«dW(:,3 
AG=AG. x (1+muxDeltaTxones (NbTraj,1)+sigmaGs«dW(:,4 


end 
$Computation of the the mean guarantee premium 
PremiumPayOff=min (max (0,Debt1-Al1) +max(0,Debt2-A2)+... 
max (0,Debt3-A3) ,AG) / (Debt 1+Debt2+Debt3) ; 
$Discounting 
Premium=exp (-r*T) «mean (PremiumPayOff,1) ; 
rep=Premium; 
end 


We run this program with initial assets values A;(0) = $50 and debt face value F; = $75. 
We vary the guarantor’s initial value from $ 1 to $ 150. The results are illustrated in Figure 
13.2. 


13.2 VALUATION OF CREDIT INSURANCE PORTFOLIOS 
USING MONTE CARLO SIMULATIONS 


This section is based on the work by Gendron, Lai and Soumaré (2006 a and b). The objective 
of this study is to investigate the effect of multiyear risk-management decisions on credit 
insurance portfolios. For this purpose, we contrast portfolio risk diversification with increased 
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insurer’s capital in a multiyear contractual environment. Using equivalent martingale measures 
(EMMs) in Harrison and Pliska (1981), we measure the value of credit insurance through the 
market value of vulnerable financial guarantees, which accounts for all shortfalls, clients’ and 
insurers’ specific risks, and capital as well as the correlations between the parties. 

From an insurance company’s viewpoint, portfolios of financial guarantees are complex 
because of their risk distribution structure and the systematic nature of credit risk, which 
underscores the importance of multiyear management. 

The model presented below captures different types of risks inherent to portfolios of credit 
insurances through stochastic cash flows of the policyholders, net-assets of the insurer, stochas- 
tic interest rate and stochastic volatilities. In addition, default before the maturity of the debt 
is triggered by missing on a coupon payment. 


13.2.1 Stochastic Processes 


We make the standard assumptions of no-arbitrage pricing of Merton (1974) and Harrison and 
Kreps (1979), i.e., perfect, frictionless markets in which securities are traded in continuous 
time, with no tax, no transaction costs, no asymmetric information, and so on. 

Assume a complete probability space (Q, F, Q) where F = {F;;t > 0} is the filtration, F, 
is the set of information available at time ¢ and Q the risk-neutral probability measure. 

We consider a portfolio of N credit insurance contracts. Each credit contract consists of 
insuring one debt. Let us denote by A; the cash flows available for loan repayment by the 
policyholder i (i = 1,..., N). The process describing the cash flows are 


dAj(t) =r(t)A;(t)dt + oj (t)A;(t)dWi(t), b= JaN, (13.13) 


where r is the short-term interest rate, {o;; i = 1,..., N} are the instantaneous volatilities of 
the cash flows’ returns described below (see equation (13.16) for the volatilities and equation 
(13.17) for the interest rate). (Wi: i = 1,..., N} are Wiener processes under the risk-neutral 
probability. 

Let ASSET be the total assets of the insurer and LIABILITIES the current existing liabilities. 
The total net-assets available to the insurance company to support credit insurance claims is 
denoted by Ag: 


Ag(t) = max(ASSET(t) — LIABILITIES(r), 0). (13.14) 


This equation links conceptually the insurer’s net-asset, capital, and insuring capacity. It is 
assumed to follow the Ito diffusion process 


dAg(t) =r(@t)Ag(t)dt + og(t)Acg(t)dWo(t), (13.15) 


where og is the instantaneous volatility of the insurer’s total net-assets described below in 
equation (13.16) and We a Brownian motion under the risk-neutral probability. 

The volatilities of the cash flows {A;, i=1,...,N} and the insurer’s total net-assets (Ag) 
are stochastic and their processes are given as follows: 


dop(t) = ve(Be — of (1) dt + pror orod Wilt) + 1 — 02, dWi(0)), (13.16) 
fork = 1,..., N, G, where Wi, k= 1,..., N, G are Wiener processes. The coefficients {vz, 
Br, k; k= 1,..., N, G} are respectively the speed of adjustment, the long-run mean and 


the variation of the diffusion volatility, and the constants {pzo; k = 1,..., N, G} represent 
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the correlations between the variables and their volatilities. Note that the volatility risk is 
rewarded in the sense that it earns a risk premium and it is incorporated into the stochastic 
process through the speed of adjustment parameter. 

The dynamic of the stochastic interest rate is 


dr(t) = «(0 = rdt + o,r(t)’dW,(t), (13.17) 


where « is the speed of adjustment, @ is the long-run mean. ø, is the volatility of the interest 
rate changes which is set to a constant, and W, is a Wiener process. Note that, as discussed in 
Chapter 11, the interest rate risk is rewarded by way of its market price of risk. Here, this is 
implicitly reflected in equation (13.17) and adjusted through x. 

The Cox, Ingersoll and Ross (CIR, 1985b) and Vasicek (1977) interest rate processes are 
particular cases of this process, with y equal to 0.5 for the CIR (1985) process and 0 for the 
Vasicek (1977) process. 

The Brownian motions satisfy the following conditions 


E2[dW,(t)dW,(t)| = pudt, Vk,le{1,...,N,G,r}, kÆl, (13.18) 
EL id WOA =0, Wk, D € {1,..., N,G,r} x {1,...,N,G}, (13.19) 
ELWA] =0, Yk, = {1,...,N,G}, kl. (13.20) 


13.2.2 Expected Shortfall and Credit Insurance Valuation 


The debt underwritten on the contract of policyholder i is represented by a coupon paying 
bond with coupon C; at each interval At of time and face value F; at maturity date T;. The 
coupon payments are made semiannually and subtracted from the policyholder’s cash flows’ 
value denoted by A;. The insurer seizes the policyholder’s remaining cash flows at the first 
default date. 

Again, we use a no-arbitrage argument similar to Harrison and Kreps (1979) and Harrison 
and Pliska (1981); therefore the debts are priced using the equivalent martingale measure 
(EMM) @ as follows. 

In absence of default, the bond price at any time T is 


“at ttkAt T 
D,(t) = E£? X exp (-f rods) C; + exp (-/ rods) F; |. (13.21) 


From this formula, the default-free coupon bond price at time t is the expected present value 
of all future payments until maturity. The first part of the expectation expression is the present 
value of all remaining coupons and the second part is the present value of the final payment. 

The policyholder i defaults on its debt service if the cash flow A; drops below the required 
payment. This occurs at the first exit time T;: 


T = inf{t such that A;(t) < C; or Aj(t) < Ci + F; if t = Tj}. (13.22) 


Hence, the price of the risky debt at time zero without guarantee, Dj G(0), is 


k 


(1-1 


kAt Tj 
Dina) = Eg | X. exp (- f rods) Ci + exp (- f rods) Ai (ti) 
k=l 0 0 

(13.23) 
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Intuitively, from equation (13.23), in the absence of a credit insurance, all future payments 
are lost upon default at time t;. The risky debt price is the expectation (under the risk-neutral 
probability Q) of the present value at time zero of all coupons received prior to default plus 
the value A;(t;). Here, the default trigger date is random and can be any coupon date. For a 
zero-coupon bond, the default trigger date has no random component, corresponding exactly 
to the maturity date of the debt. 

Credit insurance enhances the value of the risky debt. The price at time zero of the insured 
debt, D;,g(0), is 


D;,g(0) = Ding (0) 


+ Ee [exp (- f rods) min(@; Ag(t;), Di(ti) — A)| » (13.24) 
0 


where œ; is the allocation percentage of the insurer’s wealth to policyholder i. 

In equation (13.24), the price of the insured debt is equal to the price of the risky debt without 
credit insurance plus an additional amount representing the value of the credit insurance. Upon 
default at time t;, if the insurer’s contribution (@;AgG(t;)) combined with the value of the cash 
flows (A;(T;)) is sufficient to cover all remaining payments on the debt, then the lender receives 
full payment. Otherwise, the insurer can only provide partial payment. 

The total expected shortfall on the loan is measured by the difference between the value of 
the default-free debt and the noninsured debt: D;(0) — Ding(0). In the spirit of Merton (1977), 
the difference between the default-free debt and the insured risky debt comes from the fact 
that insurance companies can default. Some government debt can practically be considered as 
default-free. The value of credit insurance is the difference between the insured risky debt and 
the noninsured risky debt: 


Di,g(0) — Ding (0) = EF [ex (- i i ris\) 
0 


x min(a; Ag(7;), D;(t;) = A)| s (13.25) 


In a portfolio context, our key variables are the total expected shortfall (ES) and the credit 
insurance (CI) per unit of total risk-free debt value and are given by 


ED: O — Ding) 
Eo D: (0) 


ES = (13.26) 


yy (Dig) — Di nc (0)) 


CI = 
X; o D: (0) 


(13.27) 


ES captures not only the stochastic cash flows and insurer’s net-assets risks, but also the 
interest-rate risk and the correlation risks among these sources of uncertainty. The difference 
between ES and CI represents what is not covered by the risky insurer. For a riskless insurer, 
e.g., government corporations, this difference is zero. 
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13.2.3 MATLAB Program 


Here is the MATLAB program to value portfolios of credit insurances. 


function Guarantee (NbCoupons, VG, d) 

$Function simulating the guaranteed debt and debt without guarantee 

$for three firms. 

SNbCoupons: Number of coupons of the debts. 

SAG: Initial value of the guarantor. 

$d: Quasi debt ratio initial for all firms. 

$Initial parameters 

T=NbCoupons/2; 

NbSteps=NbCoupons ; 

NbTraj=5000; 

DeltaT=T/NbSteps; 

SCholesky decomposition. 

MatrixL=CreateMatrixChol (DeltaT) ; 

$Vectors keeping information on different 

sdebts for computing the mean on all trajectories. 

DebtNoGuaranteeMean=zeros (3,1); 

DebtGuaranteeMean=zeros (3,1); 

DebtNonRiskyMean=zeros (3,1); 

sVectors keeping information on total loss 

expected and credit insurance. 

Es=zeros (NbTraj,1); 

Ci=zeros (NbTraj,1); 

$For each trajectories we compute the debts’ values 

for j=1:NbTraj 
%We initialize to 0 the information vectors 
[r,A, Sigma, SigmaGuarant ,Default,ValueDefault,... 
ValueDefaultGuarant, RateBeforeDefault,RateAfterDefault,... 
ValueDebt, ValuePaid, DebtNoGuarantee,DebtGuarantee]=... 
InitializeToZero; 
sSimulation of brownian motion, interest rate and variance. 
dW=GenerateVariable (MatrixL,NbSteps) ; 
r=SimulationRates (dW,r,NbSteps,DeltatT) ; 
Sigma=SimulationSigma (dW,Sigma,NbSteps,DeltaT) ; 
sComputation of debts’ value by discounting 
Debt=Axdxexp (sum(r) *«DeltaT) ; 
sComputation of the guarantor allocated to each debt. 
alpha=ComputeAlpha (Debt) ; 
SCoupon rate of debts. 
Coupon=[5;5;5]; 
SInitialization of the guarantor’s value. 
Guarant=AG; 
sComputation of the risk free debt 
DebtNonRisky=ComputeDebtNonRisky (r,Coupon,Debt,DeltaT, 

NbSteps) ; 

SAddition of non risky debt to computation of the mean 
DebtNonRiskyMean=DebtNonRiskyMean+DebtNonRisky; 
SFor each step, we simulate firms’ defaults and store 
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information. 
for i=1:NbSteps-1 
sVector storing the discounting rate from time 0 until 
default. 
RateBeforeDefault=RateBeforeDefault+DeltaT«r(1,i) 
x (ones (3,1) -Default); 
$Simulation of firms and guarantor’s value 
A = max(A.x*(1+r(1,i)*DeltaT + Sigma(1:3,i).*dW(1:3,1i)),0); 
Guarant=Guarants (1+r(1,1i) *«DeltaT+Sigma(4,i)*«dW(4,1)); 
sTemp keeps in memory firms defaulting at time 
Temp= (A<Coupon) .* (1-Default) ; 
$Vector keeping in memory the firm's value at default time 
ValueDefault=ValueDefault+Temp.xA; 
$Vector keeping the guarantor’s value when a firm defaults 
ValueDefaultGuarant=ValueDefaultGuarant+alpha.»«Guarant.*Temp; 
$Vector keeping in memory which firm defaulted since 
time 0. 
Default=Default |Temp; 
Vector keeping the value of remaining debt a time of 
default. 
ValueDebt=ValueDebt+Temp.«ComputeValueDebt (i,NbSteps,r, 
Debt, Coupon, Temp.+A,DeltaT) ; 
$Vector keeping the value paid by guarantor when a default 
occurs. 
ValuePaid=ValuePaid+min(Guarantsalpha, ValueDebt) .+«Temp; 
sAmount paid are deducted from the guarantor assets’s value 
Guarant=Guarant-sum(ValuePaid.*Temp) ; 
sComputation of debt without guarantee 
DebtNoGuarantee = DebtNoGuarantee + 
exp (-RateBeforeDefault) .*min(Coupon,A) .* (1-Default+ 
Temp) ; 
$Initialization of debt of the firms in default 
Debt=Debt -Debt .*Temp; 
sComputation of new alpha 
alpha=ComputeAlpha (Debt) ; 
sComputation of firms values when paying coupon. 
A=A-min(Coupon,A) ; 
end 
$For the last step, we make computation outside principal loop. 
RateBeforeDefault=RateBeforeDefault+ 
DeltaTsr(1,NbSteps) « (ones (3,1) -Default) ; 
A = max(0,A.«*(1+r(1,NbSteps) *«DeltaT + Sigma(1:3,NbSteps) .«*... 
dW(1:3,NbSteps) )); 
Guarant=Guarants (1+r(1,NbSteps) «DeltaT+Sigma(4,NbSteps)+*... 
dw(4,NbSteps) ) ; 
Temp= (A<Coupon+Debt) .* (1-Default) ; 
ValueDefault=ValueDefault+Temp. xA; 
ValueDefaultGuarant=ValueDefaultGuarant+Guarant.«Temp; 
Default=Default | Temp; 
DebtNoGuarantee = DebtNoGuarantee + 
exp (-RateBeforeDefault) .*min(Coupon+Debt,A) .« (1-Default+Temp) ; 
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ValueDebt=ValueDebt+Temp. x (Coupon+Debt -A) ; 
ValuePaid=ValuePaid+min(Guarantxalpha, ValueDebt) .*Temp; 
sComputation of guaranteed debt’s value 
DebtGuarantee=DebtGuarantee+exp (-RateBeforeDefault) .*«ValuePaid; 
SDeducting the amounts paid from the firms’ 
A=A-min(Coupon+Debt ,A) ; 


SComputation of 


values 


expected loss and credit insurance. 

Es(j) = sum(DebtNonRisky-DebtNoGuarantee) /sum(DebtNonRisky) ; 
Ci(j) = sum(DebtGuarantee) /sum(DebtNonRisky) ; 

SAddition of the debt guarantee and debt non guarantee for the 
mean. 
Debt NoGuarant eeMean=DebtNoGuaranteeMean+DebtNoGuarantee; 
Debt Guarant eeMean=DebtGuaranteeMean+DebtGuarantee; 


end 


$Computation of expected loss and credit insurance for 


computing the mean. 
EsMoy=mean (Es) 
CiMoy=mean (Ci) 
end 


function MatrixL=CreateMatrixChol (DeltaT) 


$Function computing Cholesky decomposition 


of the covariance 


matrix. 
SCorrelations 
Rho12=0.1; Rhol3=0.5; Rho23=-0.3; 
Rho1G=0.2; Rho2G=0.2; Rho3G=0.2; 
Rholr=0.1; Rho2r=0.1; Rho3r=0.1; 
RhoGr=0.1; 


end 


function d 


sConstruction of the covariance matrix 

MatrixCov=DeltaTx [1,Rho12,Rho013,Rho1G, 
Rho12,1,Rho23, Rho2G, Rho2r, 0, 0, O, OQ; 
Rho13,Rho23,1,Rho3G, Rho3r, 0, 0, 0, Q; 
Rho1G, Rho2G, Rho3G, 1, RhoGr, 0, 0, 0, Q; 
Rholr, Rho2r, Rho3r, RhoGr, 1, 0O, 0O, 0, OQ; 
070,.0:,.0,, 0:,.1.,:05-0:,/0% 
050.;-0:7.05'0.,-0:7. 2 :04-0% 
0,0,0,0,0,0,0,1,0; 
0,0,0,0,0,0,0,0,1]; 

Cholesky decomposition 

MatrixL=chol (MatrixCov) ‘; 


=GenerateVariable (MatrixL,NbSteps) 


$Function generating brownian motions. 


S$MatrixL: 


Cholesky decomposition matrix 


SNbSteps: Number of time steps. 


end 


function 


dW=MatrixLs«randn(9,NbSteps) ; 


r=SimulationRates(dW,r0, NbSteps,DeltaT) ; 


$Function simulating the interest rate. 
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%dW: Brownian motion 
Sr0: Interest rate at time t=0. 
SNbSteps: Number of time steps. 


SDeltaT 
r=zeros (1,NbSteps) ; 
0151) =r0e 


$Simulation of interest rate avoiding negative rates. 
for j=1:NbSteps-1 
v(1,j+1) =max(r(1,j)+0.02%(0.05-r(1,j)) *«DeltaT+ 
O.2*r(1,3)7(0.5)*dW(5,5),0); 
end 
end 


function Sigma=SimulationSigma (dW, Sigma0,NbSteps, DeltaT) 
$Function simulating the firms’ volatilities. 
SdW: Brownian motions 
$SigmaO: Initial volatility 
SNbSteps: Number of time steps 
sDeltaT 
sParameters 
nu=0.5; 
beta=Sigma0.%2; 
phi=0.15; 
rho=0.1; 
Sigma=zeros (4,NbSteps) ; 
Sigma(:,1)=Sigma0; 
$Simulation of volatilities avoiding negative volatilities. 
for j=1:NbSteps-1 
Sigma (:,j+1)=(max(0,Sigma(:,j) .*2+nux (beta- 
Sigma(:,j).*2)*DeltaT+ 
phis«Sigma(:,j) .«* (rhoxdW(1:4,3)+ 
(1-rho) *0.5*dW(6:9,j3)))).%0.5; 
end 
end 


function Alpha=ComputeAlpha (Debt) 
$Function computing the guarantor’s proportion allocated to each 
debt 
SDebt: Remaining debts vector. 
$Computation of alpha avoiding dividing by zero. 
if sum(Debt,1)~=0 
Alpha=Debt/(sum(Debt,1)) ; 
else 
Alpha=0«Debt ; 
end 
end 


function ValueDebt=ComputeValueDebt (i,NbSteps,r,Debt,Coupon,A, 

DeltaT) ; 
S$Function computing the debt’s value and remaining debt when 
there is 
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$default before maturity. 

$i: Time of default 

SNbSteps: Number of time steps 

$r: Risk free interest rate 

SDebt: Firms debts’ values 

sCoupon: Each debt coupon 

SA: Firms’ values 

$DeltaT 
%The remaining debt comprises the coupon’s unpaid part 
ValueDebt=Coupon-A; 
$Addition of coupons that would have been paid if there were... 
% no default. 
for j=i+1:NbSteps-1 

ValueDebt=ValueDebt+exp (-sum(r(i+1:j))*DeltaT) «Coupon; 

end 
$Addition of last payment comprises the facial value. 
ValueDebt=ValueDebt+exp (-sum(r(i+1:NbSteps) ) *DeltaT) x (Coupon+ 
Debt) ; 

end 


function DebtNonRisky=ComputeDebtNonRisky (r, Coupon, Debt, 
DeltaT,NbSteps) 
$Function computing the debts’ value if they were risk free. 
sx: Interest rate 
sCoupon: Coupon rate 
sDebt: Face values of debts. 
$DeltaT 
SNbSteps: Number of time steps. 
$Initialization of the risk free debt. 
DebtNonRisky=zeros (3,1) ; 
$For each coupon, we discount at risk free rate. 
for j=1:NbSteps-1 
DebtNonRisky=DebtNonRisky+exp (-sum(r(1:j)) *«DeltaT) «Coupon; 
end 
SAddition of last coupon and face value of debt. 
DebtNonRisky=DebtNonRisky+ 
exp (-sum(r(1:NbSteps) )*DeltaT) x (Coupon+Debt) ; 


end 


function [r,A,Sigma,sSigmaGuarant,Default,ValueDefault,... 
ValueDefaultGuarant, RateBeforeDefault,RateAfterDefault,... 
ValueDebt , ValuePaid, ValueNoGuarantee, DebtGuarantee] =Initialize 
ToZero 
$Function initializing parameters and vectors to their initial 
values. 
r=0.05; 
A=[50;50;50]; 
Sigma=[0.2;0.3;0.5;0.25]; 
sigmaGuarant=0.25; 
Default=zeros (3,1); 
ValueDefault=zeros (3,1); 
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ValueDefaultGuarant=zeros (3,1); 
RateBeforeDefault=zeros (3,1); 
RateAfterDefault=zeros (3,1); 
ValueDebt=zeros (3,1); 
ValuePaid=zeros (3,1); 
ValueNoGuarantee=zeros (3,1); 
DebtGuarantee=zeros (3,1); 

end 


We use the program to compute the total expected shortfall or total loss per unit of debt ES 
and the value of the credit insurance per unit of debt CI. Figure 13.3 plots these values as a 
function of time to maturity T. 

Since the guarantor has limited wealth, we can see on the graph the maximum level of its 
insuring capacity. Indeed, when maturity increases, the number of coupons increases and the 
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Figure 13.3 Total loss and credit insurance as a function of maturity 
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guarantor hits its insuring capacity limit. After this maximum insuring capacity, the value of 
the credit insurance per unit decreases since the credit insurance value is at its maximum and 
at the same time the risk-free debt continues to increase, which decreases the credit insurance 
per unit of debt. 

We present a second graph that shows the evolution of ES and CI as a function of the 
quasi-debt ratio. The initial asset values for the firms are $ 50, the initial net-asset value of the 
guarantor is $ 100, the maturity of the debts is 5 years and coupons of $5 are required from 
each firm every six months. With these parameters values, we obtain Figure 13.4. 


Notes and Complementary Readings 


Complementary readings on financial guarantees are Merton (1974, 1977), Merton and Bodie 
(1992), Lai (1992), Lai and Gendron (1994), Lai and Yu (1999), Gendron, Lai and Soumaré 
(2002, 2006 a and b), Lai and Soumaré (2005) and Angoua, Lai and Soumaré (2008). 
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14 


Risk Management and 
Value at Risk (VaR) 


Risk management is very important for both individuals and corporate entities (e.g., institu- 
tional investors, pension funds, insurance companies, asset managers and, particularly, banks). 
Under its widest scope, risk management includes market risk, liquidity risk, credit risk, in- 
surance risk, operational risk, legal risk, and so on. 

Nowadays, we observe that financial and non financial institutions own complex portfolios 
composed of many financial contracts (for example, futures, options, swaps and other complex 
derivatives) and non financial products (for example, insurance contracts, contractual clauses, 
and structured finance). Moreover, the different financial disasters (Barings, Bear Stearns, 
Enron, Long Term Capital Management, Orange County, WorldCom, etc.) that occurred 
during recent decades have brought to the fore the crucial necessity and importance for all 
parties to implement efficient methods for measuring and controlling risk in order to sustain 
the firm’s competitive advantage, and to assure the survival of institutions and the stability of 
the financial systems. All stakeholders have an interest in monitoring risks in the economy 
since the financial health of the economy depends on it. 

Given that managers like simple concepts and measures that provide a global idea of all 
the risks their institutions face, the “snapshot” Value at Risk (VaR) fits the bill. Indeed, in 
order to quantify and manage risks, many institutions use the Value at Risk. The Value at Risk 
is defined as an aggregate risk measure which gives the expected extreme loss of owning a 
portfolio or an asset for a given period of time given a specific confidence level. The VaR gives 
a unique value reflecting the risk level of a portfolio when considering the financial leverage, 
the diversification and the multidimensional nature and interdependence of risks. 

Computation approaches for the VaR can be classified into two groups: parametric ap- 
proaches and non parametric approaches. However, the VaR presents many limitations which 
can be circumvented by conditional VaR models. In this chapter, we limit ourselves to a pre- 
sentation of the basic elements related to the computation of the market risk VaR (Market VaR 
and no other risk such as credit risk VaR or CreditVaR and operational risk VaR or OpVaR) 
using Monte Carlo simulations. 

The remainder of this chapter is structured as follows. We begin by describing each type 
of risk. Then we give a formal definition of the VaR computation. After this, we discuss 
the Basle regulatory environment and the concepts of stress testing and back testing. Then 
we describe the different VaR computation approaches. Finally, since our objective is to 
provide ingredients for the VaR computation using Monte Carlo simulations, we give two 
examples of such calculations. 

In Chapter 8, we characterized the dynamics of asset prices. In Chapter 4, we showed how 
to generate correlated random variables. We apply those concepts studied in Chapters 4 and 8 
in the context of VaR computations. 

The next chapter gives examples where we apply the Quasi Monte Carlo simulation tech- 
niques and the Principal Components Analysis technique to evaluate a bond portfolio’s VaR. 
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14.1 TYPES OF FINANCIAL RISKS 


We present a very brief and non exhaustive taxonomy of the risks that both financial and non 
financial institutions have to face. 


14.1.1 Market Risk 


Market risk corresponds to the risks generated by market variables’ fluctuations such as 
interest rates, exchange rates, stock prices, and commodity prices on the financial results and 
the economic value of an institution or a portfolio of assets. 


14.1.2 Liquidity Risk 


The goal of liquidity risk management is to ensure that a financial institution is able to obtain 
rapidly and efficiently all monetary amounts to meet its short term obligations even during a 
crisis. Liquidity risk management is essential to ensure the confidence of clients, suppliers, 
depositors and creditors as well as the survival of the institution. 


14.1.3 Credit Risk 


Credit risk is the risk that a counterparty (a borrower, an endorser, an issuer, a guarantor 
or an institution) defaults on its commitments (for example principal or interest payments). 
For banks, this risk represents a major source of risk and a significant part of their capital 
requirements. 


14.1.4 Operational Risk 


Operational risk can be defined as the financial loss or loss in terms of public image following 
malfunctionings or inefficiencies of organizational, procedures and logistics. It includes losses 
resulting from defaults or any inefficiencies of internal processes and systems, individuals, 
external events, defaults in the command chain, bad supervision, human errors and frauds. 


14.2 DEFINITION OF THE VALUE AT RISK (VaR) 


The first commercial model for computing VaR was launched in 1994 by J.P. Morgan. VaR, 
an aggregate measure of risk, can be defined as the expected extreme loss emerging from the 
ownership of a portfolio or an asset during a specific period of time given a specific confidence 
level. Generally, the time period is one or 10 days. The VaR provides a unique value reflecting 
the risk level of the portfolio taking into account financial leverage and diversification. 

For example, if a portfolio has a daily VaR of $ 10 million with a confidence level of 99%, 
it means that the total potential loss on this portfolio will be on average more than 10 million 
one day out of 100 days of successive transaction days, or two or three days during the year. 
Using VaR, we do not answer the question “How much could I lose on my portfolio during 
a specific period of time?” but “What will be the maximal loss on a given period with a 
small probability (for example 1%)?”. In other words, VaR models developed by banks try to 
measure the maximum potential loss for a fixed probability on a given period of time. The 
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Figure 14.1 Value at Risk (VaR) with a = 1% 


VaR of a market position is thus a sole number measuring and summarizing this position’s 
risk for a given confidence level. 

If we denote by AP the random variable representing the variation in the portfolio value 
(variation for a period of j days), then we determine a number or VaR, such that: 


Prob(AP < —VaR) = a, (14.1) 


with a = 1%, 5% or 10%. We clearly see that the VaR is the percentile of order œ x 100. In 
this case 1 — œ represents the VaR’s confidence level. Figure 14.1 provides an illustration. 


14.3 THE REGULATORY ENVIRONMENT OF BASLE 


We offer below a general overview of the different Basle accords regarding the use of VaR. 

In 1988, the Basle committee introduced the Basle Capital Accord I with the major impetus 
from the Bank of International Settlement (BIS) to fix a minimum equity capital requirement 
for the banks (8% for corporate loans and 4% for non-insured residential mortgages) in order 
to avoid the risk of insolvency. 

In 1996, this same committee approved an approach allowing banks to use their own models 
of internal risk management but still in conformity with specific quantitative and qualitative 
criteria defined by the Basle Committee on Bank Supervision. Table 14.1 presents the main 
quantitative and qualitative criteria that internal VaR models have to comply with. 


Table 14.1 Quantitative and qualitative criteria for internal VaR models 


Quantitative criteria Qualitative criteria 
e VaR must be computed daily on a time horizon e Perform stress-tests for their portfolio to 
of 10 days. measure the impact of extreme market 
e Confidence level is 99%. conditions. 
e These models must take into account the e Must have independent risk management units 
impact of time variations of volatilities on directly linked to the top management. 
derivative products prices. e Analyze internal risk management systems 


using internal and/or external auditors. 
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The New Basle II accord follows the same objectives as the Basle I accord with the addition 
of new guidelines for the treatment and control of operational risk. The Basle II accord is 
based on the following three pillars: 


Pillar 1: Minimal capital requirement derived from the composition of risks of all assets 
and liabilities: risk-weighted assets are the sum of assets weighted by market, credit and 
operational risks. 

Pillar 2: Supervisory review process ensures that banks follow rigorous procedures, 
measure their risk exposures correctly, and have sufficient capital to cover their risks. 
Pillar 3: Market discipline introduces a radical change which imposes the disclosure of 
information on risks to investors in order to apply a market discipline that discourages 
inappropriate and excessive risk taking. It allows the disclosure of information, as well 

as the transparency and viability of the financial system. 


To gain control over their risks and to follow regulatory directives, Basle allows institutions 
to use their own internal VaR models but they must perform the following two tests, stress 
testing and back testing, which we briefly describe in the next two sub-sections. 


14.3.1 Stress Testing 


This approach is being used to partially answer some of the following questions: “What will 
happen in extreme market conditions?”; “What will be the probable loss level when the VaR 
limit is reached?” We can subdivide the stress testing process into three principal steps. 


1. The process begins with a set of extreme market scenarios. These scenarios are based on 
the historical crises that the market has gone through (Asian crisis, Russian crisis, etc.), or 
are simply hypothetical, such as supposing a market movement equivalent to 10 times its 
standard deviation. 

2. For each scenario, we evaluate the variation in the portfolio’s value. 

3. We prepare a report of the principal results summarizing the gains and losses for each 
extreme scenario and point out sectors and industries that would be more affected. 


14.3.2 Back Testing 


Back testing is a procedure used to validate the efficiency of internal VaR computation systems. 
It can be defined as a simple statistical technique which determines the number of times that 
real losses exceeded the levels predicted by the VaR. 

It consists of comparing the VaR distributions with realized losses where the elements to 
consider are: (i) Find the value of cash flows to be compared with the VaR. The VaR must be 
compared to static profits and losses, since the VaR supposes a static position over one or 10 
days. (ii) Test the model validity: it consists of computing the probability frequencies of losses 
greater than the estimated level by the VaR model. 


14.4 APPROACHES TO COMPUTE VaR 


AS we mentioned in the introduction to this chapter, many approaches exist to compute a VaR. 
Here we present some of these approaches. 
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14.4.1 Non-Parametric Approach: Historical Simulations 


The non-parametric VaR is obtained from the distribution of historical data and doesn’t 
necessarily imply the computation of parameters from a theoretical distribution. 

This approach relies on the use of past daily series of risk factors influencing the portfolio 
value, and valuation model or price function linking the variations of the portfolio’s value 
to the risk factors’ variations. It is then possible, from the portfolio’s actual composition, to 
simulate the empirical distribution of the portfolio’s change in value, and thus to obtain the 
VaR. Although this approach is easy to understand, it suffers from many limitations such as: 


e the assumption that the past predicts the immediate future; 

e data sample size that require a lot of observations; 

e considerable results sensibility to the time horizon and the time period; 
e consistency of collected data. 


14.4.2 Parametric Approaches 


The parametric approach supposes that returns are derived from a given theoretical distribution, 
like a Gaussian distribution. For this class of approach, we present the delta-normal or variance- 
covariance method and the Monte Carlo simulations method. 


Delta-normal method or variance-covariance method 


This is a parametric method which uses historical data to determine the market returns’ distri- 
bution. Among the advantages of this approach, we note that it is flexible and simple. However 
this approach suffers from weaknesses related to the following restrictive assumptions: 


e the linearity of portfolios’ returns; 

e the normality of market factors changes; 

e the existence of a covariance matrix that characterizes historical data. However, it is often 
hard to estimate and forecast these matrices. 


The assumption of the normality of short-term returns is generally not satisfactory in spite of 
its simplicity of computations. Indeed, these distributions are often leptokurtic, which implies 
more frequent extreme events. Moreover, methods based on the covariance underestimate a 
portfolio’s risk when the distribution is flatter and highly skewed to the left. 

A practical application of the variance-covariance method is the commercial model Risk- 
Metrics developed by J.P. Morgan and made public in 1994. This model postulates the shape of 
the risk factors’ distribution. Once this is defined, it is necessary to determine the risk factors’ 
variance-covariance matrix on the historical observation period. 

The delta-normal method can be improved to take into account the convexity of returns 
using an extension of this method: the delta-gamma method. 


Monte Carlo simulations method 


The risk factors’ scenarios are directly taken from the past in the variance-covariance approach 
and in the historical simulations approach, but in the Monte Carlo approach, these are simulated 
through the mathematical modeling of a stochastic process for each factor. 
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In practice, portfolios include several financial assets which are themselves influenced by 
many financial risk sources. Even relatively simpler assets such as bonds can depend on a 
combination of two or more stochastic variables. 

In the following, we describe the Monte Carlo simulations method for the case which 
includes many sources of risk. We also provide two application examples of this approach. 


14.5 COMPUTING VaR BY MONTE CARLO SIMULATIONS 
14.5.1 Description of the Procedure 


We consider a portfolio P composed of N assets of prices X;, j = 1,..., N. The description 
of the asset prices’ processes is based on the stochastic differential equations theory seen in 
Chapter 8. We assume the asset prices processes to follow an Ito process defined as: 


AXO = X(t — (uj At +o AZO), j=l,....N, (14.2) 


where the values of Z; are taken from a standard Gaussian distribution. 

In Chapter 4, we described how to generate correlated random variables. We will apply 
concepts seen in that chapter to the VaR computation context. Indeed, if variables Z; are not 
correlated between themselves, then the sampling can be generated independently for each 
variable. But generally, the variables are correlated, and to consider these correlations, we 
begin by generating a set of variables Y which will then be transformed into Z. We described 
the process of transforming variables Y into Z in Chapter 4. This transformation procedure 
requires the Cholesky decomposition and/or the eigenvalues decomposition. 

To compute our portfolio’s VaR, we perform a large number of simulations, for example 
100 000 simulations (the number of simulations depends on the desired precision) of the prices 
of the assets for the VaR computation period. For each simulation, we compute the portfolio’s 
value. Thus, we obtain 100 000 simulated values for our portfolio. Finally, for a given level of 
confidence 1 — a = 99% (that is œ = 1%), the VaR is the value of the portfolio exceeded by 
99% x 100000 simulated values or the 1% x 100000" smallest value. 

We note that the procedure used in VaR estimation by Monte Carlo simulations is similar to 
the one employed for complex options valuation, but in the VaR case, there is no discounting. 


14.5.2 Application: VaR of a Simple Bank Account 


This example is inspired by Marrison (2002), but with modifications. We suppose that our 
savings account balance is described by the following stochastic process 


0.1 0.05 0.1 
P(t + At) = P(t)+ P(t) (o te gaz” ® E gogo ® ae 200) At, (14.3) 


where Zy follows a Gaussian distribution N(O, 1). 73m and rj, are respectively three month 
and one year interest rates driving this process. We suppose then that our account’s value is 
explicitly affected by the three month and one year interest rates. 

Our objective is to compute the VaR of this bank account. We therefore characterize the 
term structure of interest rates as studied in Chapter 11. For this purpose, we assume that the 
three month and one year interest rates are correlated as well as correlated with the five year 
interest rate. We respectively consider the following processes for the three month, one year 
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and five year interest rates: 
r3m(t + At) = r3m(t) + 0.25(0.035 — r3m(t))At + 0.07(r3m(t))°°V AtZam(t), (14.4) 
riy(t + At) = riy(t) + 0.26(0.040 — riy (t) At + 0.05071, (0))° VAt Zit), (14.5) 
rsy(t + At) = rsy(t) + 0.27(0.060 — rsy(t))At + 0.05(r5)(t))° > VAtZsy(t), (14.6) 


where variables Z3m, Z1y, Z5y, drawn from a Gaussian distribution N(0, 1), satisfy the following 
correlation structure: 


P3m,1y = COIT(Z3m, Ziy) = 0.10, (14.7) 
P3m,5y = COM(Z3m, Zsy) = 0.20, (14.8) 
Ply.sy = Corr(Ziy, Zsy) = 0.50. (14.9) 


We note that the interest rate processes follow mean reverting processes, for example the Cox, 
Ingersol and Ross (1985) process described in Chapter 11. 


Simulating the interest rates processes 


With initial values, 73m(0) = 0.035, r1,(0) = 0.040, 75,(0) = 0.060, simulate these three interest 
rates on a monthly basis for a 3 year period. We must note that in this exercise, we generate the 
term structure of interest rates for a horizon of 3 years. However, the VaR will be computed 
according to a shorter horizon. Here is the code used to obtain the simulation results. 


function RatesGenerate 
sFunction generating interest rates for 3 months, 1 year 
sand 5 years satisfying the specified correlations. 


S$Initial parameters 
T=3; 

NbSteps=36; 
DeltaT=T/NbSteps; 
sqDeltaT=sqrt (DeltaT) ; 


$Cholesky decomposition of the covariance matrix. 
CovTheo=[1,0.1,0.2;0.1,1,0.5;0.2,0.5,1]; 
MatrixL=chol (CovTheo) ’; 


SInitial rates 
r0=[0.035;0.04;0.06]; 


$Vector storing the historical rates 
r=cat (2,r0,zeros(3,NbSteps) ) ; 


SRates dynamic parameters 
Speed=[0.25;0.26;0.27]; 
Factor=[0.07;0.05;0.05]; 
Exponent=[0.5;0.25;0.25]; 


%šLoop simulating the rates for each time 
for i=1:NbSteps 
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Figure 14.2 Possible scenario for the interest rates evolution 


dW=MatrixL*randn (3,1); 
r(:,i+1)=r(:,1i) + Speed.*(r0-r(:,i))*DeltaT + 
Factor.*r(:,i).*Exponent*sqDeltaT.*dw; 
end 


sWe plot the graphs for the rates. 
plot ([0:DeltaT:T] ,r) 


end 


One possible scenario for the interest rates’ paths is presented in Figure 14.2. 


Computing the VaR of a bank account 


We are required to compute the VaR of this bank account for a horizon of | year with a 
confidence level of 99%. 

To compute the VaR, we simulate M scenarios for the bank account from equation (14.3) 
with the interest rates simulated over one year. Then, we compute the VaR by taking the 
0.01M"™ smallest value. In Table 14.2 we present the VaR results and the estimation error for 
M = 1000, 10000, 100.000, 1 000 000. 


Table 14.2 99% Value at Risk for a bank account 


M VaR ($) Error ($) 
1000 33.082 0.032 
10.000 35.779 0.010 
100 000 35.336 0.003 


1 000 000 35.261 0.001 
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Here is the program that has been used to obtain these results. 


function VaRAccount (N) 
SFunction computing the value at risk for one year of a bank ac- 
count. 
SN: Number of simulations to do 
$Vector storing the balance in the account for all the simulations. 
Account=zeros(N,1); 
S$Initial balance in the account. 
InitialBalance=100; 
$For each simulation we compute the account’s balance 
for j=1:N 
Balance=InitialBalance; 
Account (j,1)=BalanceAccount ; 
end 
SSorting of the balances 
Account=sort (Account) ; 
$Value at risk computation. 
VaR=mean (Account) -Account (floor(0.01*N) ,1); 


function rep=BalanceAccount 
sFunction generating the balance in an account 

SInitial parameters 

T=1; 

NbSteps=12; 

DeltaT=T/NbSteps; 

sqDeltaT=sqrt (DeltaT) ; 

$Cholesky decomposition of the covariance matrix. 

CovTheo=[1,0.1,0.2;0.1,1,0.5;0.2,0.5,1]; 

MatrixL=chol (CovTheo) ’; 

SInitial interest rates 

r0=[0.035;0.04;0.06]; 

$Vector storing the historical rates 

r=cat (2,r0,zeros(3,NbSteps) ); 

sParameters of the rates’ dynamics 

Speed=[0.25;0.26;0.27]; 

Factor=[0.07;0.05;0.05]; 

Exponent=[0.5;0.25;0.25]; 

$Loop generating the interest rates for each step 

for i=1:NbSteps 
dW=MatrixL*randn (3,1); 
r(:,1+1)=max(0,r(:,1i) + Speed.*(r0-r(:,i))*DeltaT +... 

Factor.*r(:,i).°Exponent*sqDeltaT.*dW) ; 
Balance=max (0, Balance+Balance* (0.1+0.1/0.035*r(1,i+1)+... 
0.05/0.04*r(2,i+1)+0.1/sqDeltaT*randn (1) ) *DeltaT) ; 


end 
rep=Balance; 
end 
end 
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14.5.3 Application: VaR of a Portfolio Composed of One Domestic Stock and One 
Foreign Stock 


We consider a Canadian investor investing CAD 100000 in two market indices in equal 
weights on 25 January 2006. The indices are the Toronto stock exchange composite index 
S&P/TSX of the Canadian market and the EuroNext index. 

The exercise consists in computing the portfolio’s VaR for a horizon of 1 year with a 
confidence interval of 99% using Monte Carlo simulations. 

For the Monte Carlo simulations, we will use the following processes for the indices and 
the exchange rate. 


S&P/TSX Index: 
dS(t 
cS = psdt + o5VadtdZ,(t), (14.10) 
EuroNext Index: 
dS* (t 
Di us-dt + osx dtdZ2(t), (14.11) 
S*(t) 
Exchange rate EURCAD, i.e., 1 EUR = CAD x.xx 
de(t 
= = pdt + o,/dtdZs(t). (14.12) 
e 
We also assume that we have the following correlations: 
corr(d Z1, dZ2) = p12 (14.13) 
corr(d Z1, dZ3) = p13 (14.14) 
corr(d Z2, dZ3) = (23. (14.15) 


Parameters u; and o; are the instantaneous average rates (drifts) and average volatilities 
of the composite index S&P/TSX returns, the EuroNext index returns and the returns of the 
exchange rate EURCAD. Correlations p;j are the correlations between these variables returns. 
In this case, the coefficients are estimated roughly from the descriptive statistics of the data 
using the statistical mean and standard deviation of the sample. 

From daily data between 12 December 2005 and 25 January 2006 (see Table 14.3), estimate 
the different parameters for the processes using the empirical means, standard deviations and 
correlations. 

From these data, we can compute the parameters’ values. 


us = 0.1763% 
øs = 0.005909 
us = 0.2292% 
os» = 0.005295 
He = 0.0794% 


oe = 0.005754 
Piz = 0.47424 
pı3 = 0.09901 


p23 = —0.16983. 
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Table 14.3 Daily data for the indices and exchange rate 


Return Return Exchange Return 
TSX TSX EuroNext EuroN. rate e e 

25Jan06 11675.16 1466.24 1.4078 

24Jan06 11692.35 —0.15% 1452.10 0.97% 1.4147 —0.49% 
23Jan06 11733.37 —0.35% 1448.32 0.26% 1.4139 0.06% 
20Jan06 11605.67 1.10% 1450.90 —0.18% 1.3983 1.12% 
19Jan06 11692.97 —0.75% 1450.15 0.05% 1.4074 —0.65% 
18Jan06 11554.49 1.20% 1438.28 0.83% 1.4170 —0.68% 
17Jan06 11689.61 —1.16% 1445.22 —0.48% 1.4084 0.61% 
16Jan06 11720.97 —0.27% 1458.08 —0.88% 1.4030 0.38% 
13Jan06 11604.82 1.00% 1455.37 0.19% 1.4082 —0.37% 
12Jan06 11595.18 0.08% 1463.86 —0.58% 1.3987 0.68% 
11Jan06 11621.03 —0.22% 1455.24 0.59% 1.4049 —0.44% 
10Jan06 11597.61 0.20% 1444.99 0.71% 1.4056 —0.05% 
09Jan06 11565.21 0.28% 1452.29 —0.50% 1.4115 —0.42% 
06Jan06 11620.46 —0.48% 1448.21 0.28% 1.4157 —0.30% 
O5Jan06 11507.68 0.98% 1441.30 0.48% 1.4059 0.70% 
04Jan06 11501.48 0.05% 1438.04 0.23% 1.3908 1.09% 
03Jan06 11441.58 0.52% 1423.64 1.01% 1.3884 0.17% 
30Dec05 11272.26 1.50% 1401.57 1.57% 1.3763 0.88% 
29Dec05 11296.29 —0.21% 1402.99 —0.10% 1.3782 —0.14% 
28Dec05 11261.20 0.31% 1394.64 0.60% 1.3782 0.00% 
23Dec05 11245.37 0.14% 1387.96 0.48% 1.3841 —0.43% 
22Dec05 11255.39 —0.09% 1383.37 0.33% 1.3841 0.00% 
21Dec05 11247.25 0.07% 1381.86 0.11% 1.3809 0.23% 
20Dec05 11180.63 0.60% 1373.63 0.60% 1.3906 —0.70% 
19Dec05 11154.28 0.24% 1373.39 0.02% 1.4038 —0.94% 
16Dec05 11136.58 0.16% 1373.45 0.00% 1.3928 0.79% 
15Dec05 11092.81 0.39% 1368.44 0.37% 1.3882 0.33% 
14Dec05 11095.81 —0.03% 1367.46 0.07% 1.3829 0.38% 
13Dec05 11137.44 —0.37% 1370.03 —0.19% 1.3733 0.70% 
12Dec05 11099.26 0.34% 1372.60 —0.19% 1.3764 —0.23% 


Investing $50 000 in each index means that at time ¢ = 0 the investor owns 


20ND: 4986 
11675.16 
times the S&P/TSX index and 
20000 = 24.1046 


1466.24 x 1.4147 


times the EuroNext index. Here is the MATLAB program generating M value trajectories 
over a 1 year horizon and computing the VaR given by the 0.01 x M® smallest portfolio 
value. 


function VaRPortfolio(NbTraj) 
S$Function computing the portfolio’s value at risk. 
SInitial values of the indices and the exchange rate. 


294 Stochastic Simulation and Applications in Finance 


STSX = 11675.16; 
SEuroNext = 1466.24; 
e=1.4147; 
$Computation of the indices proportions that we buy. 
NbTSX = 50000/STSX; 
NbEuroNext = 50000/ (e*SEuroNext) ; 
$250 working days per year 
NbSteps=250; 
DeltaT=1; 
SCholesky decomposition of the covariance matrix. 
L=chol ([1,0.47424,0.09901;0.47424,1,-0.16983;0.09901,-0.16983,1])’; 
Vector keeping the simulated values for the portfolio. 
PortfolioValue=zeros (NbTraj,1); 
$Loop simulating the trajectories. 
for i=1:NbTraj 
SRe-initialization of the indices values and exchange rate. 
STSX = 11675.16; 
SEuroNext = 1466.24; 
e=1.4147; 
$Loop simulating the indices and exchange rates dynamic. 
for k=1:NbSteps 
Z=L*randn (3,1); 
STSX=STSX* (14+0.001763+0.005909*Z(1,1)); 
SEuroNext = SEuroNext* (1+0.002292+0.005295*Z(2,1)); 
e = e* (14+0.000794+0.005754*Z(3,1)); 
end 
sComputation of the portfolio’s final value. 
PortfolioValue (i,1) =NbTSX*STSX+NbEuroNext * SEuroNext*e; 
end 
Sorting of the final value of the portfolio and computation of the 
Ssvalue at risk. 
PortfolioValue=sort (PortfolioValue) ; 


VaR=mean(PortfolioValue) - PortfolioValue (floor (NbTraj*0.01),1); 
disp (VaR); 
end 


After running the program, we compile in Table 14.4 the values at risk of the portfolio at 
the 99% confidence level for different values of M. 


Table 14.4 99% Value at Risk for the portfolio 


M VaR ($) Error ($) 


1000 35811 0.032 
10000 35893 0.010 
100 000 35671 0.003 


1000 000 35428 0.001 
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Notes and Complementary Readings 


For complementary readings on risk management and Value at Risk (VaR), the reader could 
refer to the following works: Alexander (1998), Crouhy, Galai and Mark (2001), Holton (2003), 
Hull (2005), Jorion (2000), Marrison (2002), Pearson (2002), Risk Publications (1997). The 
book by Dupire (1998) provides a state-of-the-art review of methods and application of Monte 
Carlo simulations to the pricing and management of risks. For VaR enhancements, we refer to 
Artzner, Delbaen, Eber and Heath (1999) and McNeil, Frey and Embrechts (2005). 

The reader could consult the following standard papers on the VaR: Linsmeier and Pearson 
(2000), Duffie and Pan (1997) and Ait Sahalia and Lo (2000). 

The reader could also refer to the following web site for recent papers about VaR: 
http://www.gloriamundi.org/. 

For additional references on alternative risk transfer, the management of risk in insurance 
companies and notions of integrated risk management, we suggest the books by Banks (2004), 
Doherty (2000) and Lane (2002). 

For more information about the accords and the regulatory standards of Basle, the reader 
is invited to read the Basle documents (1996, 2004), and visit the web site of the institution: 
www.bis.org. For commercial models and documents on application systems, the reader should 
visit the web sites and download papers from KMV-Moodys, J.P. Morgan, Credit Suisse, 
Morgan Stanley, Algorithmics, BNP Paribas, Société Générale, etc. 
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Value at Risk (VaR) and 
_____ Principal Components Analysis (PCA) 


In the previous chapter, we presented basic concepts on the Value at Risk (VaR) which is a 
widely used tool to measure and manage risks. In most cases, many factors influence our risk 
variables, which causes the computation of the VaR to be very complex, tedious and time 
consuming. 

However, it is possible to combine the VaR computation method with other techniques such 
as the principal components analysis (PCA) and the Quasi Monte Carlo simulations (QMC), 
which is the subject of this chapter. The principal components analysis technique is based on 
notions of eigenvalues already presented in Chapter 4 while the Quasi Monte Carlo simulations 
method has been developed in Chapter 6. When appropriately used, these two methods (PCA 
and QMC) substantially reduce both the number of variables to analyze and the computation 
time and increase the precision. This chapter is an adaptation of the Lai, Sakni and Soumaré 
(2005) article. 

We also use real data to derive the term structure of interest rates using the linear interpolation 
technique. Note that to analyze and measure the risk of a portfolio containing bonds (fixed 
income assets), one resorts extensively to the term structure of interest rates. 

The remainder of this chapter is structured as follows. In the first part, we introduce the 
principal components analysis technique. This introduction is carried out by way of graphical 
and analytical illustrations. Then we illustrate its use with real data. In the second part, we 
apply the PCA, Monte Carlo and Quasi Monte Carlo simulations techniques to the estimation 
of a bond portfolio’s VaR. Finally, the results of the PCA, the interest rates interpolation and the 
comparison of the VaR computations using Monte Carlo and Quasi Monte Carlo simulations 
are presented and analyzed. 


15.1 INTRODUCTION TO THE 
PRINCIPAL COMPONENTS ANALYSIS 


This section presents a brief description of the principal components analysis using simple 
graphical and analytical illustrations. At the end of this section, we illustrate the use of the 
technique by data. 


15.1.1 Graphical Illustration 


To illustrate schematically the principal components analysis (PCA) technique, let’s consider 
two variables: X; and X2. The principle remains the same for more than two variables. 
Figure 15.1 plots the data points in the X1-X2 plane. 

Obviously, X; and X% are correlated. However, it is possible to rotate the lines in order to 
form two independent variables as shown in Figure 15.2. 
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X2 


Figure 15.1 Data points in the two dimensional space (X1, X2) 


Figure 15.2 Data points with rotation of original axis 


Yı and Y% are independent variables by construction and most of the data are clustered 
around the Y; axis. Two objectives have been met here. The two new axes corresponding to 
the components are not correlated. Moreover, the concentration on axis Y; is much stronger 
than the one on axis Y2, and we can therefore say that the components are ordered. Thus 
Y, explains more of the variations in the data than Y2. Therefore, Yı is the first principal 
component and Y, represents the second one. This is illustrated more clearly in Figure 15.3 in 
the Y,-Y>2 plane. 


15.1.2 Analytical Illustration 


This part formalizes what we have previously described. It gives an analytical and theoretical 
formalization to the PCA. 

We consider p vectors (X1, X2,..., Xp) of order n (standardized) representing the initial 
factors that explain the phenomenon under study, i.e., if the original data is represented by 
oD)” 
D; and o (D;) are respectively the sample mean and standard deviation of data D;. 


{D;,i=1,...,p}, then the standardized variable X; is obtained as follows: X; = where 
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yä 


Figure 15.3 Data points in the new two dimensional space (Y1, Y2) 


We can create new factors such as the first component for example Y, to be a linear 
combination of variables X1, X2,X3,..., Xp, (standardized) meaning 


Yı = uy X1 +X. +--+ +uUpi Xp, (15.1) 


such that the variance of Yı is maximal. The second component Y% is constructed as a linear 
combination of the same initial variables 


Yo = u42X1 + Uu2X2 +--+ Up2X p, (15.2) 


such that Y, has the maximal variance among all linear combinations uncorrelated with Y,, 
a*(Y1) > o2(Y>), and the correlation between Y; and Y3 is zero, corr(Y;, Y2) = 0. The same 
applies to all other components Y3, Y4,..., Yp, each one of them having maximal variance 
among all linear combinations of X1, X2, X3,..., Xp and are not correlated with the previous 
components. In other words, each of the components explains a maximum of the residual 
variation. 

Let X be the matrix built from standardized vectors X1, X2,..., Xp. And let 


A= x'x (15.3) 


n—1 


be the symmetric matrix of order p of the sample correlations between variables. Usually 
IAI Æ 0, except if there is perfect collinearity. Now the problem consists in finding linear 
combinations of the original vectors X1, X2,...,X,. Thus vector Y; is obtained by 


Y; = uy X1 + uyX2+-++-+UpiXp 
= XU;, (15.4) 


m 
where Ui = (Uii, UAj,-- +5 Upi) š 
The task consists of maximizing the sample variance of vector Y: 


1 
o? (Y) = — YTY 
n—-1 


-/ 1 
=uT( x™x) U 
n-li 


=U'AU, (15.5) 


300 Stochastic Simulation and Applications in Finance 


under the constraint that vector U has a norm of one, that is, U' U = 1. We must point out that, 
without the condition requiring a unit vector, there would be no solution, since one would need 
to take larger and larger elements of U to increase the variance of Y. With this normalization 
constraint U'U = 1, the Lagrangian of the above maximization problem is written as: 


L=U'AU-AU'U - 1), (15.6) 


where A is the Lagrange multiplier. The first order condition of this maximization implies: 


sa = 2AU — 2A0U 
= 0, (15.7) 
or simply 
(A —AIU =0. (15.8) 


This system of homogeneous linear equations has a non trivial solution (U ¥ 0) if 
|A —Al| = 0. (15.9) 


Solving the characteristic equation (A — A/)U = 0, first we obtain the eigenvalues of the 
symmetric matrix A that can be ordered the following way: A, > Az >... > Ap»; and then the 
corresponding eigenvectors U1, U2,..., U p. Eigenvectors and eigenvalues satisfy the relation 


(A — A,1)U; = 9, (15.10) 


or, AU; = A,U; with UU; = 1, fori =1,2..., p. 

The elements of these eigenvectors are precisely the weights of variables X1, X2,..., Xp) in 
order to build the linear combinations Y1, Y2,..., Y, called components: Y; = XU;. Since the 
ultimate goal of the principal components analysis is to reduce the problem’s dimensionality, 
we will keep a reduced number of components to explain a large portion of the variance. 

Indeed, since the vector is normalized to unity, the variance of Y; is 4;: 


o7(¥;) =U,’ AU; 
= U;' iU; 
= Ài. (15.11) 


And since 


p 
Xo = trace(A) = p, (15.12) 


i=1 


the percentage of the variance that is explained by the first component is 


At 
a (15.13) 
P 
by the first two components 
ài +À 
1A (15.14) 
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by the first three components 
Aitagtas 
p id 


(15.15) 


and so on. 


15.1.3 Illustrative Example of the PCA 


Here is a simple example that enables us to apply the PCA method. We use historical data 

over two years on American Treasury bonds for maturities of 1 year, 2 years, 3 years and 

5 years. These data, shown in Table 15.1, will be indexed by D1, D2, D3 and D4 respectively. 
The steps for the principal components analysis are: 


1. Enter data: Dı, D2, D3 and D4. 
2. Standardize variables Dı, D2, D3 and Da, that is: 


_ D-DD; 


j= ——., 1 = 1,2,3,4, 15.16 
o(Di) one) 


where D; and o (Dj) are respectively the sample mean and standard deviation of data Dj. 
3. Construct the covariance matrix of matrix X composed of X1, X2, X3, X4 as follows: 


1 


n—1 


A= XTX. (15.17) 


Table 15.1 American Treasury bonds yields 


Date 1 year 2 years 3 years 5 years 


10-2003 1.25 1.75 2.26 3.19 
11-2003 1.34 1.93 2.45 3.29 
12-2003 1.31 1.91 2.44 3.27 
01-2004 1.24 1.76 2.27 3.12 
02-2004 1.24 1.74 2.25 3.07 
03-2004 1.19 1.58 2.00 2.79 
04-2004 1.43 2.07 2.57 3.39 
05-2004 1.78 2.53 3.10 3.85 
06-2004 2.12 2.76 3.26 3.93 
07-2004 2.10 2.64 3.05 3.69 
08-2004 2.02 2.51 2.88 3.47 
09-2004 2.12 2.53 2.83 3.36 
10-2004 2.23 2.58 2.85 3.35 
11-2004 2.50 2.85 3.09 3.53 
12-2004 2.67 3.01 3.21 3.60 
01-2005 2.86 3.22 3.39 3.71 
02-2005 3.03 3.38 3.54 3.77 
03-2005 3.30 3.73 3.91 4.17 
04-2005 3.32 3.65 3.79 4.00 
05-2005 3:33 3.64 3.72 3.85 
06-2005 3.36 3.64 3.69 3.77 
07-2005 3.64 3.87 3.91 3.98 
08-2005 3.87 4.04 4.08 4.12 


09-2005 3.85 3.95 3.96 4.01 
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4. Compute the eigenvalues and eigenvectors of A: A; > Az > A3 > Aq corresponding to 
eigenvectors V1, V2, V3 and V4. 
5. Normalize the eigenvectors V; as follows: 


Vi 
IV: 


Vi = 1,2,3,4. (15.18) 


i= 


6. Compute the principal components: 
Y; = XU;. (15.19) 


7. Compute the proportion of the variance that is explained by the components. The proportion 
explained by the first component is 


Ay 
Ay tar +A3 + Aq’ 


(15.20) 


the proportion explained by the first two components is 


Ay + à2 
Ai tag +A3 + à’ 


(15.21) 


the proportion explained by the first three components is 


Ay + à2 + àÀ3 
Ai tag +A3 +A4’ 


(15.22) 


and the proportion explained by the four components is 


Aitastastag _ 
ài +à +à +i 


(15.23) 


The MATLAB program used to extract the components is given as follows: 


function PCA 

$Function extracting the principal components. 
SEmpirical data (can be imported from an external file). 
Rates= [1.25 , 1.75, 2.26, 3.19: ; 


3.85 , 3.95 , 3.96 , 4.01]; 

$Computation of the average and standard deviation of rates. 
Average=mean (Rates,1); 
Deviation=std (Rates) ; 

sNormalization of the rates to get a mean of zero and 
Sstandard deviation of one. 

for i=1:size(Rates,1) 

Rates(i,:) = (Rates(i,:)-Average) ./Deviation; 

end 

SInterest rates covariance matrix. 

CovRates=cov (Rates) ; 

sDetermination of the eigenvalues and eigenvectors. 
[U, Lambda] = eigs(CovRates,4) ; 

end 


Value at Risk (VaR) and Principal Components Analysis (PCA) 303 


We obtain the following eigenvectors 


0.4980 0.5443 
0.5074 0.2672 
Ui = | 0.5102 | 42 = | ~0.0439 |’ 
0.4839 —0.7940 
—0.6477 0.1903 
y, — | 04056 _ | —0.7118 
>= 1 0.5494 |’? ~*~ | 0.6603 |’ 
0.3379 0.1456 


and the following eigenvalues 


Ay = 3.8379, Az = 0.1602, A3 = 0.0017, A4 = 0.0002. 
Now, if we want to compute the principal components, we just enter the MATLAB command 


$Command for computing the principal components from the 
eigenvectors. 
Components=Rates«U; 


The percentage of the variance that is explained by the first component is 
3.8379 = 3.8379 
3.8379 + 0.1602 + 0.0017 + 0.0002 4 


by the first two components is 99.95%, by the first three components 99.995% and finally by 
the four components 100%. 


= 95.95%, 


15.2 COMPUTING THE VAR OF A BOND PORTFOLIO 


In this section, we compute the VaR of a bond portfolio using the PCA technique and the 
Monte Carlo (MC) and Quasi Monte Carlo (QMC) simulations. 


15.2.1 Sample Description and Methodology 


To illustrate the approach, we use a portfolio composed of two coupon-paying bonds with the 
characteristics shown in Table 15.2. Even though, we use only two bonds to form our portfolio, 
the methodology can be easily adapted for portfolios of more than two bonds. 

We use the Canadian market monthly spot interest rates over the period February 1996 to 
November 2003, i.e., 93 observations in total. The term structure of interest rates is represented 
by nine points defined by the spot rates of 1 month, 3 months, 6 months, | year, 2 years, 3 


years, 5 years, 10 years and 30 years. We denote by {r;, i= 1,..., 9} these initial variables 
with the subscript i being the maturity in years. For later use, the standardized values {X;, i = 
1,..., 9} of these initial spot rates are obtained as follows: 
Cee F i 
Xi = i (15.24) 
Oj 


where 7; is the sample average and ø; is the sample standard deviation of the spot rate 7;. 
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Table 15.2 Characteristics of the two coupon-paying bonds portfolio 


Bond 1 Bond 2 
Issue date 10 May 2003 10 November 2001 
Maturity 3 years 7 years 
Evaluation date 10 November 2003 
Weight 0.5 0.5 
Coupon rate 0.06 0.08 
Face value $ 1000 
Coupon payment period semi-annual 
Parameter for the QMC sequence 
1* prime number 2 
2" prime number 5 


To compute the VaR of the portfolio, we use the principal components analysis (PCA) 
technique and Quasi Monte Carlo (QMC) simulations (see Chapter 6 for the description of 
this technique) or standard Monte Carlo (MC) simulations (used extensively in the previous 


chapters). The steps are described schematically in Diagram 15.1. 
The steps described in Diagram 15.1 can be summarized as follows: 


1. Obtain the principal component factors and extract a parsimonious number of k factors. 
2. Represent the sample points in the k-dimensional cube with a uniform distribution based 
on QMC Low Discrepancy Sequences (please refer to Chapter 6 for a description of these 


sequences) or Monte Carlo pseudo-random numbers. 


Apply the PCA on the term structure of interest rates 


— 


i Construct the model with k factors 


Generate the numbers from a uniform distribution 
with the QMC LDS or MC pseudo-random numbers 


— 


z Transform the sequence into multivariate normal distribution 


Simulate N times the spot rates r, using the multivariate 
normal sample 


Compute the value of the portfolio at each simulation and 


-E construct the distribution of the changes of values 


Compute the VaR at a given confidence level 


Diagram 15.1 Description of the VaR computation steps 
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3. Transforming the random numbers with uniform distribution into a k-dimensional vector 
with multivariate Gaussian distribution, we thus obtain the sample 


Z = (Zi, Z2,..., Zo). (15.25) 
4. Simulate the spot rates using the k factors as follows: 
ri = rio +0; (Uny M Z1 +UA + + + Use Ze), (15.26) 


where r;o is the current spot rate, o; is the spot rate volatility over the VaR horizon, U;; 
represents the ith element of the eigenvector j, 4; is the jth eigenvalue. These variables are 
obtained from the principal components analysis described in the previous section. This 
last equation will give us simulated interest rates for each maturity. 

5. From these rate processes, simulate N times the value of the portfolio. Then draw the 
distribution of the portfolio value. Finally, compute the VaR from the distribution at a 
specified confidence level 1 — œ by taking the wth-percentile as described in Chapter 14. 


15.2.2 Principal Components Analysis (PCA) 


Table 15.3 gives the results from the PCA. The table shows that the first factor explains 73.32% 

of the total variance. The first and second factors explain 97.56% of the total variation of the 

variance. And finally, the first three factors explain 99.34% of the variation in the total variance. 
Since the first two components explain more than 97% of the total variance, we will then 

retain only these first two factors. Table 15.4 presents the sensitivity of the nine initial spot 

rates with respect to the first two principal components, which we name hereafter PC; and PC. 
Thus variable X; can be written as: 


Xi = wiPC; + wj2PCo, (15.27) 
where X; refers to the spot rate with maturity i. For example, for the one-month spot rate, 
Xim = 0.8480PC, — 0.4808PC2, (15.28) 
for the three-year spot rate 
X3y = 0.9742PC, + 0.1638PCo, (15.29) 


and finally for the thirty-year spot rate, 
X30y = 0.4334PC, + 0.8698PC). (15.30) 


Table 15.3 Results from the PCA 


% of the variance % of cumulative variance 

Factors Eigen values explained explained 
1 6.599 73.318 73.318 
2 2.182 24.241 97.559 
3 0.160 1.777 99.337 
4 0.045 0.498 99.835 
5 0.009 0.099 99.934 
6 0.003 0.039 99.973 
7 0.001 0.011 99.984 
8 0.001 0.009 99.993 
9 0.001 0.007 100.00 
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Table 15.4 Loadings on the first two principal components 


Components 

Variables PC, PC, 

1 month 0.8480 —0.4808 
3 months 0.8805 —0.4568 
6 months 0.9166 —0.3896 
1 year 0.9579 —0.2566 
2 years 0.9880 —0.0173 
3 years 0.9742 0.1638 
5 years 0.8822 0.4532 
10 years 0.6746 0.7316 
30 years 0.4334 0.8698 


The numbers in Table 15.4 represent the square root of the eigenvalue times the correspond- 
ing eigenvector. From the table, we observe that the first factor loadings are almost similar for 
all spot rates except for long-term maturities (10 and 30 years). Therefore, any variation of the 
first factor (PC; ) will lead to a parallel shift in the term structure. For this reason, the first factor 
is commonly called the trend component of the zero-curve. In our sample, more than 73% of 
the total change in the term structure can be attributed to the parallel shift from the first factor. 

The second factor loadings increase with the maturity of the spot rates. Thus, a positive 
shock on PC, will lead to a diminution of the short-term spot rates and an augmentation of 
the long-term spot rates. This second factor is called the tilt of the yield curve and accounts 
for almost 24% of the total variation of the yield curve in our sample. 

The third factor loadings (which we omit from our analysis) have a U-shape, i.e., the 
loadings are positive and decreasing for short-term rates, they become negative for middle 
range rates, and increase and become positive for long-term rates. The third factor, PC3, affects 
the convexity of the yield curve. 

Figure 15.4 displays the trends of the first three factors (PC;, PC2 and PC3) with respect to 
the maturity of the zero-coupon bonds. 
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Figure 15.4 Trend of the first three principal components 
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15.2.3 Linear Interpolation or Bootstrapping for the Intermediate Spot Rates 


Recall the portfolio is composed of coupon-paying bonds. The coupons are paid semi-annually. 
The initial nine spot rates are not enough to compute the price of the bonds in our portfolio 
since the intermediate spot rates are missing. We use the linear interpolation technique to 
obtain the other spot rates. 

To illustrate the methodology, consider for example a yield on a zero-coupon bond maturing 
in eight months. Suppose that we know the six-month and twelve-month spot rates. The eight- 
month spot rate is obtained with a linear combination of the six-month and twelve-month rates 
such that the two portfolios have the same duration, i.e., 


Durationg,, = x Durationg,, + (1 — x)Duration;2,, (15.31) 


with x the percentage of wealth invested in the six-month zero-coupon bond. Duration; stands 
for the duration of a zero-coupon bond with maturity i and represents the first derivative of the 
bond price with respect to the interest rate or its interest rate sensitivity coefficient. We need 
to find x. Since these are zero-coupon bonds, then 


Durations, = ; (15.32) 
and 
Duration;2,. = 1. (15.33) 
Thus, 
Durationg,, = xi +(1-—x), (15.34) 


which implies x = Z, 

The eight months spot rate is obtained with 2/3 of the six months spot rate and 1/3 of the 
one-year spot rate. We apply the same technique using the nine initial spot rates (1 month, 
3 months, 6 months, 1 year, 2 years, 3 years, 5 years, 10 years and 30 years) and obtain the 


interpolated intermediate spot rates as follows: 


1 1 1 1 

ris = z! + A r25 = 572 + z 
3 1 1 1 

r35 = 4° + g” r4 = 5/3 + z5 
1 3 9 

ras = Gra + 41s» rss = ig + T 
4 1 7 3 

re = 3/5 F 5/10, r65 = T05 + 10”! 

4 1 

rı = To’ + 9"! 75 = 7” + 5110 
2 3 3 7 

rg = 5/5 te 5710 rs = T0 + 10! 
1 1 9 

ro = zř5 + z"10, ros = —r5 + — 10 
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To sum up, we use the initial term structure of interest rates data available, i.e., spot rates of 
1, 3, 6 months, 1, 2, 3, 5, 10 and 30 years, and the other spot rates needed are obtained using 
the above linear interpolations. 


15.2.4 Computing VaR by MC and QMC Simulations 


We retain the first two factors for our analysis. Thus, the simulation equation will be: 


ri = rio + 9; (Uiz, F Uri i222) , (15.35) 


where r;o is the current spot rate, o; is the spot rate volatility over the VaR horizon, and Z; 
are distributed normally. This equation is used to generate the nine initial spot rates of the 
zero curve and the other rates are obtained by linear interpolation as described above in the 
previous sub-section. 

Here is a program used to simulate the rates. For practical purposes, the spot rates’ initial 
data will be kept in an EXCEL file and will later be converted into a text file (SpotRates.txt). 
The MATLAB program for the implementation of all the VaR computation steps described in 
this chapter is presented below. Note that the MATLAB program extracts data from a text file 
called SpotRates.txt. 


function VaR=MC (NbTraj,p,alpha) 

$Function computing the value at risk of a 
Sportfolio composed of two bonds. 

SNbTraj: Number of simulations. 

$p: Weight for each bond. 

Salpha: Confidence level 

SIdentification of our data file 
fid=fopen(’SpotRates.txt’,’rt’); 

SReading of the data from file SpotRates.txt 
Data=fscanf(fid, ‘sf %f Sf Sf Sf SE Sf SE SE’); 
Closing of source file. 


fclose (fid); 

%Partition of data into date and different rates 
M1=Data(1:9:size(Data)); %šspot rate 1 month 
M3=Data(2:9:size(Data)); %šspot rate 3 months 
M6=Data(3:9:size(Data)); *spot rate 6 months 
Y1=Data(4:9:size(Data)); spot rate 1 year 
Y2=Data(5:9:size(Data)); spot rate 2 years 
Y3=Data(6:9:size(Data)); spot rate 3 years 
Y5=Data(7:9:size( ae Sspot rate 5 years 
Y10=Data(8:9:size(Data)); %spot rate 10 years 
Y30=Data(9:9:size(Data)); %spot rate 30 years 


SMatrix of rates 
Datal=[M1,M3,M6,Y1,Y2,Y3,Y5,Y10,Y30]; 
SInitial rates 

ActualRates=Datal (size(Datal,1),:); 
sNormalization of rates 


for i=1:9 
Average (i)=mean(Datal(:,i)); 
Deviation(i)=std(Datal(:,i)); 
Datal(:,i)=(Datal(:,i)-Average(i))/Deviation(i) ; 


end 
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sThe standard deviation for a one month period. 
%We want a value at risk for 10 days and we divide by 


ssqrt (3) 


since 1 month = 310 days. 


Deviation=Deviation/sqrt (3) ; 


oe 


Eigenvectors and eigenvalues of the covariance matrix 


%Vector -Vi can also be returned by function 
seigs if Vi is an eigenvector. 
[V, E] =eigs (cov (Datal) ,3); 
sComponents determination. 


PC1=V(: 
PC2=V(: 
PC3=V(: 


,1); 
12); 
ey 


sGraph of the principal components. 


plot([1:1:9],sqrt(E 
[1:1:9],sqrt(E(3,3))*PC3,’m’); 


(1,1))*PC1,’r',[1:1:9],sqrt(E(2,2))+*PC2,’b’, 


Weightl=p; 
Weight2=1-Weight1; 
SInitial parameters 
CouponRatel1=0.06; 
FaceValuel=1000; 
CouponRate2=0.08; 
FaceValue2=1000; 
couponl=CouponRatel«FaceValuel; 
coupon2=CouponRate2«FaceValue2; 
sVectors for quasi random variables 
ul=zeros (NbTraj, 1); 
u2=zeros (NbTraj, 1); 
%Simulation of quasi random numbers 
for 1=1:NbTraj 
ul (1) =norminv (VanDerCorput (1,3) ) ; 
u2 (1) =norminv (VanDerCorput (1,5)); 


end 


SSpot rates matrix 
Rates=zeros (9,NbTraj) ; 
sLoop generating the interest rates 
for j=1:NbTraj 
for i=1:9 


end 
end 


$Computation of rates. We multiply by the sign of the first 
selement of the components to fix them positively. This will 
allow 
Sus to replicate perfectly our results if needed. 
Rates (i,j)=ActualRates(1,i)+... 
Deviation (i) (sqrt (E(1,1))+*ul(j,1)*«V(i,1)*sign(V(1,1))+... 
sqrt (E(2,2))*u2(j,1)*V(i,2) «sign(V(1,2))); 


%Spot rates initial values 


M6=ActualRates(1,3); %spot rate 6 months 
Y1l=ActualRates(1,4); %spot rate 1 year 

Y2=ActualRates(1,5); %spot rate 2 years 
Y3=ActualRates(1,6); %spot rate 3 years 
Y5=ActualRates(1,7); %spot rate 5 years 
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$Linear interpolation 


Y1_5=(Y1+Y2) /2; $spot rate 1.5 years 
Y¥2_5=(Y2+Y3) /2; S$spot rate 2.5 years 
Y¥3_5=(3%Y3+Y5) /4; S$spot rate 3.5 years 
Y4=(Y3+Y5) /2; Sspot rate 4 years 

Y4_5=(3*Y5+Y3) /4; S$spot rate 4.5 years 


sDetermination of initial ponds" prices. 


pricel=(coupon1/2)/(1+(M6/100))^(1/2)+(coupon1/2)/(1+(Y1/100)) 


(coupon1/2)/(1+(Y1-5/100))^(3/2)+ (coupon1/2) / (1+ (¥2/100) 
(FaceValuel+ (couponl1/2) ) / (1+ (¥2_5/100))* (5/2); 
price2=(coupon2/2) / (1+ (M6/100) )* (1/2) + (coupon2/2) / (1+ (Y1/100) 


(coupon2/2) / (1+ (¥1_5/100 


( 

(coupon2/2) / (1+ (¥3_5/100 

(coupon2/2) / (1+ (Y4_5/100))*(9/2)+. 

(FaceValue2+ (coupon2/2)) / (1+ (¥5/100) ) 
sPortfolio’s initial price. 
InitialPrice=Weight1lspricel+Weight2s*price2; 


))* (3/2) + (coupon2/2) / (1+ (Y¥2/100) ) 
coupon2/2) / (1+ (¥2_-5/100) ) * (5/2) + (coupon2/2) / (1+ (¥3/100) ) 
))* (7/2) + (coupon2/2) / (1+ (Y4/100) ) 
)) ) 


sLoop computing the possible evolution of the portfolio's price. 


for k=1:NbTraj 

M6=Rates (3,k); $Spot rate 6 months 
Y1=Rates (4,k); SSpot rate 1 year 
Y2=Rates(5,k); SSpot rate 2 years 
) 3 
5 


Y3=Rates(6,k); SSpot rate years 

Y5=Rates(7,k); SSpot rate years 

Y1_5=(Y¥14+Y2) /2; SSpot rate 1.5 years 

Y2_5=(Y¥2+Y3) /2; SSpot rate 2.5 years 

Y3_5=(3«Y3+Y5) /4; S$Spot rate 3.5 years 

Y4=(Y¥3+Y5) /2; %Spot rate 4 years 

Y4_5=(3*Y5+Y3) /4; SSpot rate 4.5 years 

SValuation of two different ponds 

pricel=(coupon1/2)/(1+(M6/100))^(1/2)+(coupon1/2)/(1+(Y1/100) 
(coupon1/2)/(1+(Y1-5/100))^(3/2)+ o / (1+ (Y2/100)) 


(FaceValuel+ (coupon1/2) ) )/ + Y2_5/100))^(5/2) 


price2=(coupon2/2)/(1+(M6/100))^(1/2)+(coupon2/2)/( aaa Aoo 


coupon2/2)/(1+(Y1-5/100 


( 
coupon2/2)/(1+(Y3-5/100 
coupon2/2)/(1+(Y4-5/100 (9/2 

(FaceValue2+ (coupon2/2)) / (1+ (¥5/100)) 
%Computation of the portfolio's price for this scenario 
PortfolioPrice=Weightl»pricel+Weight2»xprice2; 
vectPrice(k,1)= PortfolioPrice; 
end 
%šSorting the prices and VaR computation. 
vectPrice=sort (vectPrice); 

VaR= (InitialPrice-vectPrice (floor (alpha/100xNbTraj))); 
end 


function rep=VanDerCorput (n, b) 


(3/2) + (coupon2/2)/(1+(Y2/100)) 


))* ) 

coupon2/2) /(1+(¥2-5/100) ) * (5/2) + (coupon2/2) / (1+ (Y¥3/100) 
))* (7/2) + (coupon2/2) / (1+ (Y4/100) ) 
Be )+ 


Function generating the Van Der Corput sequence to build Halton 


sequence. 
sn: Index of the sequence of elements. 
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sb: Basis for decomposition. 

bn=0; 

j=0; 

while n~=0 
bn=bn+mod (n,b) /b* (j+1) ; 
n=floor(n/b) ; 
j=j+1; 

end 

rep=bn; 

end 


Table 15.5 compares the results for the portfolio VaR from MC and QMC simulations with 
respect to the number of simulations N and the confidence level (99% (œ = 1%), 95% (a = 
5%) or 90% (a = 10%)). It illustrates the convergence pattern for each simulation technique 
as the number of simulations increases. The standard deviation, o, is used to measure the 
convergence and stationarity of the VaR for different numbers of simulations. For the QMC 
simulations, we use the Halton sequence. 

For a confidence level of 99%, the standard deviation with MC is 1.491 and with QMC it 
is 0.125. Thus, the error of the VaR is on average 11.891 times higher with MC compared to 
QMC. With a confidence level of 95%, the standard deviation with MC is 0.648 and 0.151 
with QMC. In this case, MC standard deviation is 4.304 times higher than that of QMC. With 
a 90% confidence level, the standard deviation obtained with MC is 0.274 and with QMC 
it is 0.205. The standard deviation of the VaR with MC is 1.335 times higher than that with 
QMC. These findings are illustrated in Figures 15.5, 15.6 and 15.7. The graphs show that 


Table 15.5 Portfolio VaR with MC and QMC 


Portfolio VaR with MC and QMC ($) 


MC QMC MC QMC MC QMC 


N 10% 10% 5% 5% 1% 1% 
500 21.46 21.16 26.22 28.38 34.53 39.43 
800 22.25 21.37 27.84 28.28 41.31 39.43 
1000 21.94 21.69 29.29 28.00 39.24 39.43 
3000 21.74 21.74 28.04 27.92 37.81 39.22 
5000 22.43 21.85 28.45 28.03 39.75 39.43 
8000 21.73 21.85 27.84 28.00 38.71 39.43 
10 000 21.45 21.76 27.40 28.00 37.98 39.43 
30.000 22.03 21.84 28.07 27.91 39.62 39.21 
50.000 22.04 21.82 28.09 27.92 39.27 39.17 
80.000 21.76 21.83 27.96 27.91 39.38 39.22 
100 000 21.83 21.83 203k 27.91 39.11 39.22 
150 000 21.63 21.82 27.73 27.90 39.00 39.17 
200 000 21.80 21.82 27.81 27.90 39.16 39.17 
250 000 21.78 21.82 27.80 27.89 39.20 39.16 
o 0.274 0.205 0.648 0.151 1.491 0.125 
Ratio 2¥¢ 1.335 4.304 11.891 
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Figure 15.5 Convergence of simulations, 99% (œ = 1%) 
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Figure 15.6 Convergence of simulations, 95% (œ = 5%) 
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Figure 15.7 Convergence of simulations, 90% (a = 10%) 
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QMC converges quickly after 5000 simulations, with MC, the same results are obtained only 
after 150000 simulations. 


Notes and Complementary Readings 


For further readings on the use of factor models in bond portfolio management, we refer the 
reader to papers by Litterman and Scheinkman (1991), Barber and Copper (1996), Kreinin, 
Merkoulovitch, Rosen and Zerbs (1998), Lai, Sakni and Soumaré (2005). 

For a complementary reference on principal components analysis, consult the book by 
Dionne (2002). 
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Appendix A 


Review of Mathematics 


A.1 MATRICES 


A matrix of order (n, m) is a rectangular array of n x m elements placed in n rows and m 
columns as follows: 


4il 412 aim 
a21 a22 sts dm 

A=| (A.1) 
ani an2 *** Anm 


and we denote by A = (aij)nxm, 1 < i < n, 1 < j < mand (A); = aj is the element of matrix A 
at the intersection of row i and column j. In the following we suppose aj is a real number. 

A matrix of order (n, m) is also called an n by m matrix or n x m matrix. When n = m, the 
matrix is a square matrix of order n. For such a matrix, elements 411, 422 ,..., Ann form its 
principal diagonal. 


Definition 1.1 Two matrices A = (ajj)nxm and B = (bij)pxq are equal if and only if: 


e They have the same order (that is n = p and m = q) and 
eaj=—bj) Vij, l<i<n=p,l<j<m=q. 


Example 1.2 The null matrix of order (2, 3) noted 02x3 is the matrix 


0 {9 0 0 
x= o 0 0?’ 


while an order 3 identity square matrix is 


100 
bkb=|0 1 0}, 
001 


matrix such that the elements on the principal diagonal are all equal to 1 and the elements 
off diagonal are zero. Generally, we can define a null matrix n by m and identity matrices 
of order n, where m and n are any integer number. 
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A.1.1 Elementary Operations on Matrices 


The following elementary operations on matrices are summarized. 


Multiplication by a Scalar 


Let a be a real number and A = (a;ij)nxm a matrix n x m. The product A by a, noted gA , is a 
matrix n by m obtained by multiplying each element of A by q, that is 


aA = (aaj;) (A.2) 


nxm ` 
Addition and Substraction of Matrices 


Let A = (aij)nxm and B = (bij)nxm be two n by m matrices. The sum of A and B denoted by 
A+B is a matrix n by m whose element of row i and column j is obtained by summing elements 
of row i and column j of A and B. Then, we have A + B = (aj + bij)nxm Or equivalently 


(A + B);; = aij + bij. (A.3) 
The substraction of B from A is the matrix 


A — B = A + (-1)B = (aij — bij) (A.4) 


nxm ` 
Transpose of a Matrix 


The transpose of an order (m, n) matrix A, denoted by A! (or A’), is the matrix m by n whose 
element of row i and column j is the element of row j and column i of A: 


(A"),, = aji, l<i<n, 1<j<m. (A.5) 


Product of Two Matrices 


Let A = (@jj)nxm and B = (bij)mxq be two matrices. The product of A by B, noted AB is the 
matrix n by q whose element of row i and column j is given by: 


m 


(AB); = 5 aikbkj. (A.6) 
k=l 


Product of Two Matrices Element by Element 


For some computations, it can be important to define an elementwise product of two matrices 
A = (ij)nxm and B = (bjj)nxm denoted by A © B, with: 


(A © B);ij = aijbij. (A.7) 


To define such product, the matrices must have the same order. 
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A.1.2 Vectors 


A row vector X of order n is a 1 by n matrix, and a column vector Y of order n is an by 1 
matrix: 


X = (X1,X2,...,Xn) (A.8) 
and 
yı 
y2 
Y=] . |. (A.9) 
Yn 


Unless specified, a vector of order n will refer to a column vector of order n. 
We define the usual scalar product of two vectors X and Y of order n, the real number 


xXly = X1 Y1 + X2y2 + + + XnYn =$ xy. (A.10) 
k=1 


The norm of X is the real number denoted by IIXII with 
IXI =VXTX = |) ox}. (A.11) 
k=1 


An n by m matrix is thus an array of n row vectors of order m, or equivalently an array of 
m column vectors of order n. 


A.1.3 Properties 
P1. Product of Two Matrices 


Let A = (aij)nxm and B = (bjj)mxq be two matrices. The product of A by B, denoted by AB is 
the n by q matrix whose element of row i and column j is obtained by computing the scalar 
product of row i of matrix A by column j of matrix B. We have 


m 


(AB); = D> airbrj. (A.12) 
k=1 
P2. Inverse of a Matrix 


A matrix A = (4jj)nxn İs said to be invertible if there exists a matrix B such that AB = BA = 
I, where J is the identity matrix of order n. In this case, B is the inverse matrix of A, and is 
denoted by A7 !. 


P3. Triangular Matrix 
A square matrix of order n, 


A= (aij) en (A.13) 
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is said to be upper triangular if a;; = O for i > j; this is the case of the matrix 


1 5 6 
A=]0 2 9 
0 0 3 


A matrix will be called lower triangular if aj; = 0 for i < j, this is the case of matrix 


1 0 0 
B=]5 2 0 
6 9 3 


A square matrix is called triangular if it is lower triangular or upper triangular. 


P4. Diagonal Matrix 


A square matrix that is upper triangular and lower triangular is a diagonal matrix. This is the 
case of matrix 


4 0 0 
D=]0 3 0 
0 0 7 


PS. Symmetric Matrix, Positive and Definite Positive Matrix 


A square matrix A of order n is symmetric if A = A'. It is positive if X 'AX > 0 for all vector 
X of order n, and is called positive definite if X' AX > 0 for all vector X of order n different of 
zero (X Æ 0,1). 


A.1.4 Determinants of Matrices 


We associate to each square matrix A a real number called determinant and denoted by IAI 
which has many applications as we will see below. 
The determinant of a square matrix of order 2 x 2 noted 


a a 
Aes oe SR (A.14) 
a2; a2 
; ; ; T2- 
is IAl = a11a22 — a12a21. The determinant of matrix 3 4 is 


1 2 
3 4 


=1x4-2x3=4-6=-2. 


Definition 1.3 Let A = (aij)nxn be a square matrix of order n. We associate to each element 
aj of A, a sub-matrix of order (n — 1) x (n — 1) obtained by deleting row i and column j of 
A. The determinant of this sub-matrix called minor of aj constitutes Aj. 
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Thus for 
2 0 3 
A= -1 1 
0 1 2 
the minor of a); = 2 is 
A on (=1)x2-1x1= -3 
= = (— x = x =). 
11 12 
Definition 1.4 The determinant of any matrix of order n 
A= (aij) nen (A.15) 
is the real number 
|Al = OD aij Aj. (A.16) 
j=l 
Equivalently, we have 
|A| = SoD aij Ai (A.17) 
i=l 


independently of row i or column j used. 


For example, if 


2 0 3 
A=]4 -1 1], 
0 1 2 
then 
[A] = X- 1) An + O(-1)'P Ap + 30-1) A 
= 2(-1)'t1 Ay + 4(-1)"*1 Aas + O(-17*1 A3, 
thus 
-1 1 4 -1 
A| =2 + 0(-1)!*7Ap +3 
|A| 12 (—1) 12 ei 
= 2(—-2 — 1) + 3(4) 
= —6 + 12 


= 6. 
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Properties of the Determinant 


1. The determinant of a matrix with one null column or row is equal to zero. 
2. The determinant of matrix B obtained from matrix A by multiplying a row or column by a 
scalar œ is 


|B| =a|A|. (A.18) 
It follows that the determinant of matrix œA is 
laA| =a" |A], (A.19) 


where n is the dimension of matrix A. 

3. The determinant of a square matrix of order n and upper or lower triangular, is the product 
of its elements on the main diagonal. 

4. A matrix is invertible if and only if its determinant is different from zero. 


A.2 SOLUTION OF A SYSTEM OF LINEAR EQUATIONS 


A system of linear equations is an equation of the form AX = B, where A is ann by m matrix, 
X is a column vector of order m and B is a column vector of order n. 


Example 2.1 Consider the system of linear equations 
x—-—3y=1 
2x —y=3 


E 


equivalent to AX = B, where 


and 


For such a system, we look for the vector X for which AX = B. In the case of an invertible 
square matrix A, it is easy to see that X = A~'B, since by multiplying by the inverse of A we 
have 


A 'AX = A7'B, (A.20) 


and since A~'A = J, we obtain X = A7!B. 
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Case of an Upper Triangular Matrix 


Consider the following linear system: 


ay, 412 Ain X1 bi 

0 a22 dn X2 bz 
OS a g oa cd (ee (A.21) 

O > O ann Xn bn 


To solve this system, we just need to note that the last component of X is x, = Pn, and the 
penultimate row of the system gives us 


pS er (Dy-1 = an—1nXn) 
Qn—-1n-1 
1 bn 
= Dn—1 — Gn—1n : (A.22) 
dn—1 n—1 Ann 


By going upwards, we can completely determine the vector X. Here is the principle. We begin 
by computing x,,. Once the value of x, is known, we compute x,_; as a function of x, and we 


repeat this step. Knowing x,,..., 441, we compute x, as a function of X41, Xk+2, - - - , Xn USING 
row k of the system. Thus, we obtain: 
b 
ae (A.23) 
Ann 
1 
Ap) = (Dn-1 — an-ı nXn) ’ (A.24) 
An—-1n-1 


1 n 
Xe = — (r eD as) ; (A.25) 


fork =1,...,n—2. 
This strategy can be used in the case of a lower triangular matrix, but one must compute 
from top to bottom. Beginning with xı, we obtain the following solution: 


by 


a aan (A.26) 
a1 
1 
X2 = — (bz — az 1x1), (A.27) 
a22 


k-1 
1 
xk = — (r - Yas (A.28) 


B= 332.05: 
As we have just seen, it is easy to solve a linear system when the matrix is upper or lower 
triangular. Thus it can be interesting to find a triangular matrix T, and an appropriate second 
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vector C such that the new system of linear equations TY = C has exactly the same solutions 
as the initial system AX = B. Many approaches to performing this operation are available in 
the literature. 


A.3 MATRIX DECOMPOSITION 
LU Decomposition 


Sometimes it is useful to decompose a square matrix A in a product of two square matrices L 
and U such that A = LU, where U is upper triangular and L is unit lower triangular. A triangular 
matrix is unit if all the entries on the main diagonal are 1. This decomposition enables us to 
easily solve the system of linear equations AX = B by solving the equivalent following system 
LUX = B that can be decomposed into two systems 


LY = B 


A.29 
UX =Y. ( ) 


We solve the first system with the triangular matrix to get Y and knowing Y we can solve the 
second system to find X. We have AX = LUX = LY = B. The Gauss elimination method is 
one of the techniques used to perform a LU decomposition of any square matrix A. For more 
details, we refer the reader to any good linear algebra book. 


Cholesky Decomposition 


If matrix A is symmetric and invertible, then it has a decomposition A = LDU, where L is unit 
lower triangular, D is a diagonal matrix and U is unit upper triangular with U = L' . Moreover, 
if all elements on the diagonal of D are positive, then A has a decomposition R' R, where R is 
an invertible upper triangular matrix. 

However, in practice, to find matrices U and L, a Gauss decomposition method or Crout 
method are often used. 


A.4 POLYNOMIAL AND LINEAR APPROXIMATION 


Definition 4.1 A polynomial P of degree n is a function from R into R with the form 


P(x) = ap +ayx! + ax? +- -an x” 


i=0 


Definition 4.2 The Taylor series of degree n to approximate (in a neighborhood of xo) the 

real function f(x) is the degree n polynomial in which all derivatives of order k (P“), k < n, 
are such that 

P(xq) = fO xo) and (A.31) 

Po) = fo). (A.32) 
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Hence 


P(x) = f (xo) + yy aes a oo Mx 0) + O"T’). (A.33) 


k=0 


A.5 EIGENVECTORS AND EIGENVALUES OF A MATRIX 


Definition 5.1 Let A be a square matrix of order n, the real number À is an eigenvalue of A if 
there exists a vector U 4 Onax; of order n (called eigenvector associated to eigenvalue à) such 
that AU = XU or equivalently 


(A — ìl U = Onx1. (A.34) 


This is equivalent to (A — XI) not invertible, or equivalently that its determinant is zero. 


Thus in order to find the eigenvalues of a matrix A, we can solve the equation IA — AJ, = 
0, and knowing À solve the equation (A — A/,)U = 0,1 to get U. Polynomial 


P(A)=|A—Al,| (A.35) 


is called characteristic polynomial of A, and its roots are the eigenvalues of A. 


Example 5.2 With an order 4 matrix, 


ke i Sh 2 
2 2 1 
A= , 
3 2 3 6 
= =]. 2 54 
we note 
a. 1 =i 2 
z gan, 3 1 
B=A-Al, = 

3 2 Bai. 6 
=i ve Js CA 


We compute the determinant of B = A — AI, to obtain the characteristic polynomial of 
A: 
|A —Al,| = 7+ 25a — 4° — 2527 + 14. 
The roots of this polynomial are the eigenvalues of A. It is not always easy to find the exact 


values of the roots of this polynomial, so in such cases we proceed by approximation. With 
MATLAB, we obtain 4; = —4.9764, à2 = 4.9643, A3 = —0.2284 and A4 = 1.2405. The 
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corresponding eigenvectors are 


0.3675 0.0297 
—0.1554 —0.4628 
1= | 0.4857 |° >T | —0.8737 
—0.7777 —0.1466 
—0.5608 —0.6586 
0.8243 —0.5062 
>| 0.0707 |> 47 | 0.4086 


—0.0324 0.3782 
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bitshift(a, k) 


bitxor(a, b) 
cat(dim, A, B) 


ceil(x) 

chol(A) 

cov(X, X) 
cumsum(A, dim) 
disp(a), disp(‘text’) 
doc function 
eigs(A, k) 


floor(a) 
fclose(...) 
figure 


fopen(‘files.txt’) 
fscanf(...) 


fsolve(@myF unction,x0) 


hold on... hold off 


inv(A) 
legend(‘Series 1’, 
‘Series 2’, ...) 
lu(A) 


max(A, [], dim) 
mean(A, dim) 


mod(a, b) 
nchoosek(n, k) 


normcdf (x) 
norminv(x) 


normpdf(x) 
ones(m, n) 


Appendix B 
MATLAB® Functions 


Transpose of matrix A. 
Product of 2 matrices or vectors, elementwise. Equivalent to the scalar 
product of 2 vectors. 
Logical OR operation between a and b. We can also use OR(a, b). 
~ is the logical NOT operator. We can also use NOT(a). 
Verify if two expressions are equivalent. 
Verify if two expressions are different. 
Shifts of k bits of integer number a. It is equivalent to multiply by 2* and 
take the integer value of the result. 
Performs the EXCLUSIVE OR operation between numbers a and b. 
Function that concatenates matrix B with matrix A according to the 
specified dimension dim. 
Returns the smallest integer greater or equal to x. 
Cholesky decomposition. Returns matrix L such that L'L = A. 
Function that computes the covariance between vectors X and Y. 
Computes the cumulative sum of elements of A following dimension dim. 
Displays the value of variable a or prints a text string. 
Displays the MATLAB documentation on the command function. 
Called with outputs [V, D] = eigs(A, k) returns the k greatest eigenvalues 
of A (ordered in matrix D) and the normalized eigenvectors associated 
(in matrix V). 
Returns the integer value of a. 
Closes the file opened by function fopen. 
Creates a second graphical environment to display many figures in the 
same program. 
Open file.txt for reading inputs in the file. 
Extracts data stored in a file opened with function fopen. 
Solves the following non linear system: myFunction(x) = 0. 
Between these bounds, the curves plotted with command plot are added 
on the same graph. 
Computes the inverse matrix of matrix A. 
Creates a legend in the last graph and names the first curve Series 1, the 
second curve Series 2,... 
LU decomposition of matrix A. 
Returns a vector containing maximal elements following the selected 
dimension (max(X) returns the maximal element of vector X). 
Computes the mean of the elements in matrix A following the selected 
dimension. 
Returns the value a modulo b. 
Computes the number of possibilities to draw k elements from a set of 
n elements. 
Gives the cumulative distribution function of the normal distribution at 
point x. 
Gives the inverse of the cumulative distribution function of the normal 
distribution at point x. 
Gives the probability density function of the normal distribution at point x. 
Initializes a matrix m x n with all its elements equal to 1. 

(continued ) 
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plot(A, B) 


randn(m, n) 
rand(m, n) 


repmat(A, m, n) 


size(A, dim) 
sort(X) 
sum(A, dim) 
tic... toc 


tinv(x) 
zeros(m, n) 


Plots the curves defined by rows of A and B. For example, the first curve 
goes through coordinates (A(1, i), BC, i)). 

Generates a matrix m x n of standard Gaussian random variables. 
Generates a matrix m x n of uniform random variables over the interval 
[0, 1]. 

Creates a matrix composed of many copies of matrix A m rows by n 
columns. 

Gives the dimension of matrix A following dimension dim. 

Sorts elements of vector X in increasing order. 

Computes the sum of elements of matrix A following the dimension dim. 
Returns the time needed to perform operations between bounds defined 
by tic and toc. 

Inverse function of the Student cumulative distribution function. 
Initializes a matrix m x n with all elements equal to 0. 


Stochastic Simulation and Applications in Finance with MATLAB? Programs 
by Huu Tue Huynh, Van Son Lai and Issouf Soumaré 
Copyright © 2008, John Wiley & Sons Ltd. 
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